1 

SEQUENCE LISTING 

<110> KOBAYASHI, KAZUO 

KITAGAWA, YOSHINORI 
KOMEDA, TOSHIHIRO 
KAWASHIMA, NAGAKO 
JIGAMI, YOSHIFUMI 
CHIBA, YASUNORI 

<120> METHYLOTROPH PRODUCING MAMMALIAN TYPE SUGAR CHAIN 

<130> 081356-0224 

<140> 10/511,436 
<141> 2004-10-25 

<150> PCT/JP03/05464 
<151> 2003-04-28 

<150> JP 2002-127677 
<151> 2002-04-26 

<160> 121 

<170> Patentln Ver. 3.3 

<210> 1 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 1 

Ala Tyr Met Phe Lys Tyr Asp Ser Thr His Gly 
15 10 



<210> 2 
<211> 11 
<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 2 

Asp Gly Pro Ser His Lys Asp Trp Arg Gly Gly 
1 5 10 



0 IS Mu LMJ 
10/511436 



<210> 3 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PGP5 for amplification of 5 '-region of 
Ogataea minuta GAP gene 



2 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (24) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a, c, g or t 

<400> 3 

gcntayatgt tyaartayga ywsnacncay gg 3 2 



<210> 4 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PGP3 for amplification of 3 1 -region of 
Ogataea minuta GAP gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (24) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a, c, g or t 



<400> 4 

ccnccnckcc artcyttrtg nswnggnccr tc 



32 



3 



<210> 5 
<211> 3186 
<212> DNA 

<213> Ogataea minuta 



<400> 5 

aagctttact 

cacggaccac 

ctctatctaa 

gcccgagatt 

gtgcagaccc 

atgtgggagc 

ccaatatctc 

ctttcagaca 

cggcagattc 

acaacgcatc 

acgggtgcac 

catatcacag 

cggacagctt 

cacccgtaca 

aaaagagccc 

gcagacctgg 

ggcacggcag 

ggtccgttcg 

tgggtagtgc 

tggttaacac 
tcgggacacg 
gcgactcttg 
cagcgagcga 
gtccactttt 
tacaattatt 
aacgtcggta 
agaaaggaca 
tacatgttca 
aagtaccttg 
atcccatggg 
accgccggcg 
tccgctgacg 
tccattgtct 
aacgacgttt 
caaaagaccg 
aacatcattc 
gctggtaagt 
ttgactgtca 
gcctctgagg 
gacttcttga 
ccaactttcg 
gtcgacttgc 
gcctctttaa 
tgtagactta 
caccggaaaa 
cattgccaaa 
catcaattgt 
ttacttctac 
atccgatgcg 
ttggtgacca 
tctcctgtgt 
gtcagccttg 



ggttcaaggg 
agttgacagc 
tcccaggcta 
cagcagactc 
ggsggatgtg 
tgtgcaaagg 
ggcaacaaga 
gtccacccgg 
gcgtccagtc 
gccacatata 
tgattgctgc 
cggactgcgc 
cttaaaacct 
cctccggagt 
gggacccatc 
ggcagacctg 
tctataccgt 
ccagtctcgt 
gcgcgaacgc 
tggcccggag 
acctaagtgg 
tgcgcgcccg 
tggtgtgcca 
ctccaatagt 
gtttgaattc 
tcaacggatt 
tcaacgtggt 
agtacgactc 
tgatcgacgg 
gtaaggaggg 
ctcaaaagca 
ctccaatgtt 
ccaacgcttc 
tcggtattga 
ttgacggtcc 
catcctccac 
tgactggtat 
acttgaagac 
gtgaacttgc 
ccgacaacag 
tcaagttgat 
ttgagcacgt 
gtaaacattt 
ctccgaagct 
ctggctattt 
ttcgatcgtt 
cctctattcc 
cggcagggat 
gtgggacttc 
aacgagatga 
cgattcgagc 
attaactttc 



gttaagtagg 
atcgactgct 
ctcgatccct 
tcgcagcgca 
aagctgggac 
ggcaatggtc 
acgcccgagg 
attccaatat 
aggccatctg 
cgtcaccgcc 
gttaacgagg 
atctctggcc 
ggctgaaact 
tgtatcctca 
gaccggaaaa 
ggttagtacc 
agaagctctt 
accaccaccg 
ctggcaccca 
tagtcatata 
cgtgcaaagc 
cggcggttcg 
gatctcaggc 
atttgacccg 
actcaattaa 
cggaagaatt 
tgccgtgaat 
cactcacgga 
tcagaagatt 
cgttgacttt 
cattgatgct 
cgttatgggt 
ctgtaccacc 
gtctggtttg 
atcccacaag 
cggtgccgct 
gtctctgaga 
cccaaccacc 
cggtatcttg 
atcttcgatc 
ctcctggtac 
tgccaaggtc 
cgttttgaat 
ccggggcttc 
gaaaaatttc 
ttctaacgga 
tttttggccg 

tggggatgat 

atgcgtccaa 
caattgactg 
ggtaccgtag 
ggagctgcgc 



ggcgcggtct 
catcgaaaac 
gcacaaccta 
gcgtgcgttt 
ggcgggaggg 
actcagcgca 
acgacgggac 
tcgcaggact 
cggcgagctg 
cgcccgctgg 
caactggaga 
tgccggacgc 
aagcgagacc 
gaagcggagt 
gaggggtgga 
agggccgaaa 
ctcgacagca 
catgacccaa 
aacacaccac 
cggagatttt 
tcgggggcta 
cgggagacgg 
cgagtcaaga 
ggttgctctc 
catacacaaa 
ggtagactcg 
gatccattca 
agataccaag 
gaggtgttcc 
gtcattgact 
ggtgccaaga 
gtcaaccaca 
aactgtctgg 
atgaccaccg 
gactggagag 
aaggctgtcg 
gttcctacca 
tacgcagaga 
ggttacactg 
tttgacgcct 
gataacgagt 
tcttccgctt 
atatttcaag 
ggctccctga 
gacgttttgc 
cgccagtcga 
atctcgtttg 
cgggatcgat 
atctgttgga 
cattgaaaag 
gtaggtctgc 
gatccacatc 



ggtctttgtg 
ggtcgcagtg 
cagagtgatc 
taatccctca 
aagtctggag 
gagcgatggt 
tctgaatgcg 
cgcgctcaga 
ctgcgctcgc 
caacctgagg 
tgtcagaggc 
ggtagcgtcc 
tgcgacctgg 
aacctgcagg 
gctagtgggg 
agggtcagag 
gcgagcagaa 
tcagcattga 
ttacgcttcc 
ggcatgattc 
aatgtttccc 
gggaaagaga 
caatatataa 
tgttgatttt 
tacaatacaa 
ttcttagaat 
tcgctgccga 
gtgaagtcac 
aagagagaga 
ccaccggtgt 
aggttatcat 
aggagtacac 
ctccattggc 
tccactctat 
gaggaagaac 
gtaaggtctt 
ccgatgtttc 
tctccgccgc 
aggacgccgt 
ctgccggtat 
acggttactc 
aagtggatag 
ttgaataatg 
atttattttt 
ttgaaactcg 
gttattgtta 
tgctgacggc 
gtcctcaact 
tgatgtgctc 
gttattagct 
tatggaggca 
tctgcaccgc 



gttgtttcta 
cggcaatctg 
cgaccgcact 
aatcaaggct 
tggtgagaga 
ggcgcggggg 
agcacgttgt 
aacgcaaccc 
gggctgcgcc 
tttttccgca 
caagtggagc 
cgtctttttg 
aacgcccgca 
cctacgcaag 
tagccttgga 
gaatcagggt 
actgcacaga 
tgctcccaca 
cgcaccgcgg 
taattccggg 
ggcgctcgcg 
ggggtgaccg 
agagaggatt 
ttctagatca 
aatggcttac 
tgctttgtcc 
gtacgctgct 
cttcgaggga 
ccctgctgac 
gttcaccacc 
cactgctcca 
caaggacttg 
caaggttgtt 
cactgccacc 
cgcttccggt 
gccagctctt 
cgttgttgac 
catcaagaag 
tgtctccact 
cttgttgacc 
caccagagtt 
atgaccaatg 
aaagccttgt 
tacatctctg 
agttgaggag 
tgtcacgtga 
ctccgaacag 
ccagaggctg 
ttctgctttt 
tttttggtct 
tgcgtcataa 
gcggaggcct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 



4 



ttgactgcag cattttaatt aatctcgtaa aataagctct taaacgagat tagcttacgg 3180 
ggatcc 3186 



<210> 6 
<211> 336 
<212> PRT 

<213> Ogataea minuta 
<400> 6 

Met Ala Tyr Asn Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu 
15 10 15 

Val Leu Arg lie Ala Leu Ser Arg Lys Asp lie Asn Val Val Ala Val 
20 25 30 

Asn Asp Pro Phe lie Ala Ala Glu Tyr Ala Ala Tyr Met Phe Lys Tyr 
35 40 45 

Asp Ser Thr His Gly Arg Tyr Gin Gly Glu Val Thr Phe Glu Gly Lys 
50 55 60 

Tyr Leu Val lie Asp Gly Gin Lys lie Glu Val Phe Gin Glu Arg Asp 
65 70 75 80 

Pro Ala Asp lie Pro Trp Gly Lys Glu Gly Val Asp Phe Val lie Asp 
85 90 95 

Ser Thr Gly Val Phe Thr Thr Thr Ala Gly Ala Gin Lys His lie Asp 
100 105 110 

Ala Gly Ala Lys Lys Val lie lie Thr Ala Pro Ser Ala Asp Ala Pro 
115 120 125 

Met Phe Val Met Gly Val Asn His Lys Glu Tyr Thr Lys Asp Leu Ser 
130 135 140 

lie Val Ser Asn Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Leu Ala 
145 150 155 160 

Lys Val Val Asn Asp Val Phe Gly lie Glu Ser Gly Leu Met Thr Thr 
165 170 175 

Val His Ser lie Thr Ala Thr Gin Lys Thr Val Asp Gly Pro Ser His 
180 185 190 

Lys Asp Trp Arg Gly Gly Arg Thr Ala Ser Gly Asn lie lie Pro Ser 
195 200 205 

Ser Thr Gly Ala Ala Lys Ala Val Gly Lys Val Leu Pro Ala Leu Ala 
210 215 220 

Gly Lys Leu Thr Gly Met Ser Leu Arg Val Pro Thr Thr Asp Val Ser 
225 230 235 240 

Val Val Asp Leu Thr Val Asn Leu Lys Thr Pro Thr Thr Tyr Ala Glu 
245 250 255 



5 



lie Ser Ala Ala lie Lys Lys Ala Ser Glu Gly Glu Leu Ala Gly lie 
260 265 270 

Leu Gly Tyr Thr Glu Asp Ala Val Val Ser Thr Asp Phe Leu Thr Asp 
275 280 285 

Asn Arg Ser Ser lie Phe Asp Ala Ser Ala Gly lie Leu Leu Thr Pro 
290 295 300 

Thr Phe Val Lys Leu lie Ser Trp Tyr Asp Asn Glu Tyr Gly Tyr Ser 
305 310 315 320 



Thr Arg Val Val Asp Leu Leu Glu His Val Ala Lys Val Ser Ser Ala 
325 330 335 



<210> 7 

<211> 1491 

<212> DNA 

<213> Ogataea minuta 



<400> 7 

aagctttact 

cacggaccac 

ctctatctaa 

gcccgagatt 

gtgcagaccc 

atgtgggagc 

ccaatatctc 

ctttcagaca 

cggcagattc 

acaacgcatc 

acgggtgcac 

catatcacag 

cggacagctt 

cacccgtaca 

aaaagagccc 

gcagacctgg 

ggcacggcag 

ggtccgttcg 

tgggtagtgc 

tggttaacac 
tcgggacacg 
gcgactcttg 
cagcgagcga 
gtccactttt 
tacaattatt 



ggttcaaggg 
agttgacagc 
tcccaggcta 
cagcagactc 
ggaggatgtg 
tgtgcaaagg 
ggcaacaaga 
gtccacccgg 
gcgtccagtc 
gccacatata 
tgattgctgc 
cggactgcgc 
cttaaaacct 
cctccggagt 
gggacccatc 
ggcagacctg 
tctataccgt 
ccagtctcgt 
gcgcgaacgc 
tggcccggag 
acctaagtgg 
tgcgcgcccg 
tggtgtgcca 
ctccaatagt 
gtttgaattc 



gttaagtagg 
atcgactgct 
ctcgatccct 
tcgcagcgca 
aagctgggac 
ggcaatggtc 
acgcccgagg 
attccaatat 
aggccatctg 
cgtcaccgcc 
gttaacgagg 
atctctggcc 
ggctgaaact 
tgtatcctca 
gaccggaaaa 
ggttagtacc 
agaagctctt 
accaccaccg 
ctggcaccca 
tagtcatata 
cgtgcaaagc 
cggcggttcg 
gatctcaggc 
atttgacccg 
actcaattaa 



ggcgcggtct 
catcgaaaac 
gcacaaccta 
gcgtgcgttt 
ggcgggaggg 
actcagcgca 
acgacgggac 
tcgcaggact 
cggcgagctg 
cgcccgctgg 
caactggaga 
tgccggacgc 
aagcgagacc 
gaagcggagt 
gaggggtgga 
agggccgaaa 
ctcgacagca 
catgacccaa 
aacacaccac 
cggagatttt 
tcgggggcta 
cgggagacgg 
cgagtcaaga 

ggttgctctc 

catacacaaa 



ggtctttgtg 
ggtcgcagtg 
cagagtgatc 
taatccctca 
aagtctggag 
gagcgatggt 
tctgaatgcg 
cgcgctcaga 
ctgcgctcgc 
caacctgagg 
tgtcagaggc 
ggtagcgtcc 
tgcgacctgg 
aacctgcagg 
gctagtgggg 
agggtcagag 
gcgagcagaa 
tcagcattga 
ttacgcttcc 
ggcatgattc 
aatgtttccc 
gggaaagaga 
caatatataa 
tgttgatttt 
tacaatacaa 



gttgtttcta 
cggcaatctg 
cgaccgcact 
aatcaaggct 

tggtgagaga 

ggcgcggggg 
agcacgttgt 
aacgcaaccc 
gggctgcgcc 
tttttccgca 
caagtggagc 
cgtctttttg 
aacgcccgca 
cctacgcaag 
tagccttgga 
gaatcagggt 
actgcacaga 
tgctcccaca 
cgcaccgcgg 
taattccggg 
ggcgctcgcg 
ggggtgaccg 
agagaggatt 
ttctagatca 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1491 



<210> 8 
<211> 524 
<212> DNA 

<213> Ogataea minuta 
<400> 8 

gtggatagat gaccaatggc ctctttaagt aaacatttcg ttttgaatat atttcaagtt 60 
gaataatgaa agccttgttg tagacttact ccgaagctcc ggggcttcgg ctccctgaat 120 
ttatttttta catctctgca ccggaaaact ggctatttga aaaatttcga cgttttgctt 180 



J 



gaaactcgag 
tattgttatg 
ctgacggcct 
cctcaactcc 
atgtgctctt 
tattagcttt 



ttgaggagca 
tcacgtgaca 
ccgaacagtt 
agaggctgat 
ctgctttttt 
tttggtcttc 



ttgccaaatt 
tcaattgtcc 
acttctaccg 
ccgatgcggt 
ggtgaccaaa 
tcctgtgtcg 



cgatcgtttt 
tctattcctt 
gcagggattg 
gggacttcat 
cgagatgaca 
attcgagcgg 



ctaacggacg 
tttggccgat 
gggatgatcg 
gcgtccaaat 
attgactgca 
tacc 



ccagtcgagt 240 
ctcgtttgtg 300 
ggatcgatgt 360 
ctgttggatg 420 
ttgaaaaggt 4 80 
524 



<210> 9 
<211> 113 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer for production of an expression cassette 
with GAP gene promoter and terminator from 
Ogataea minuta 

<400> 9 

gtttgaattc actcaattaa catacacaaa tacaatacaa agtcgacaaa aaatgcatgt 60 
ggatagatga ccaatggcct ctttaagtaa acatttcgtt ttgaatatat ttc 113 



<210> 10 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer for production of an expression cassette 
with GAP gene promoter and terminator from 
Ogataea minuta 

<400> 10 

tttttactag tacggtaccg ctcgaatcga cacaggag 38 



<210> 11 
<211> 12 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 11 

Gly Pro Tyr lie Cys Leu Val Lys Thr His lie Asp 
15 10 



<210> 12 

<211> 11 

<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 12 

Gly Arg Gly Leu Phe Gly Lys Gly Arg Asp Pro 
15 10 
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<210> 13 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PURS for amplification of 5' -region of 
Ogataea Minuta URA3 gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a, c, g or t 

<400> 13 

ggnccntaya thtgyytngt naaracncay athga 3 5 



<210> 14 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PUR3 for amplification of 3 1 -region of 
Ogataea Minuta URA3 gene 

<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
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<220> 

<221> modif ied_base 
<222> (9) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (24) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a, c, g or t 

<220> , 

<221> modif ied_base 

<222> (30) 

<223> a, c, g or t 

<400> 14 

ggrtcncknc cyttnccraa narnccnckn cc 3 2 

<210> 15 
<211> 3113 
<212> DNA 

<213> Ogataea minuta 
<400> 15 

gccgcggccg ctgctgctgc ttccactaaa acagcaacga gcaacgcgtc tgccgaaaac 60 
tccttgaatc aagacctgga caatcatttg acgttgggca gagagcattt cgacaccact 120 
gaggttccta ccgcggacgg gtccaaagtg gaggttctcc gaaacatgtc tgtcgagacg 18 0 
ggtcctgccg acgatcttaa caaaaacccc tccaccagcg agctggtgca tctggaggaa 24 0 
aaatcacagg aaagcgcatc cgaggaagag gtcaggacct cgaaccatgc cgacacagcc 3 00 
ggaacagaac caggtccaga acacgtccat ggcaacgata aagcggaggg cgagggcgag 360 
tcctcagaag atgaccagga aatggtggac gctccactgc ctccttcgga cgataaggag 420 
actgagaacg cgctgccgac ggagactaaa gtggagtcga ccaaagacga tgtagaccag 480 
gaagaagagg aagaagagga ggaagaggaa gaaacagtac ctttccaagt ctctaaaaag 54 0 
gtatccaagg aggaagaaat ttcagctccg acgccggagc ccactacgcc tacgtcggcg 600 
aacgagagcg aggaggaagg cgataccagg ccccggaaaa ggcggcggtc ggagtcgatt 66 0 
tcggccgcct ccagcaagag atttttggct cttggtactc aactgttgag ccaagtttcg 720 
tcgaatcggt ttgcgtcgat gtttttgcag ccagtgaaca aaaacgagga gcctgagtat 780 
tacaagctca tccaccagcc gatcgatctc aagacgctgt cgaagtcggt ccgaaccggc 840 
gagattcagt cgttcgatga ccttgagttc cagctgcaac tcatgttcag caatgcaatc 900 
atgtacaacg acacctacca gacggaaacg tacaaatgga cgatcgagat gatggaggaa 960 
gcccagaatc tgattgaaat gttcagggaa acttccaaca actgagatca actgcgacta 1020 
cttctgttgg ctggctggac gggttgtatt actatcttgg acaacgctat gtaaccttat 1080 
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ctaaatacaa 
gccgaaatca 
gctcctggaa 
tcaaacaggt 
tgaaacatct 
atcaaaggct 
tgcttctgct 
ttgctgtttt 
cttatcgtgg 
gatttgaatt 
gagactgcag 
actaagacat 
aacttgatgg 
gaccttttgg 
gacattgtgg 
aaacacaact 
ctccagtaca 
ggagtgaccg 
gagccaagag 
gagtacacca 
attgcgcaaa 

ggtgtcggac 

gttatgaaga 
gaccctgtcg 
gtcgcatgat 
ggagtttgtc 
acaccagcag 
tgtagaagaa 
aagtttcgag 
cctcgttgta 
acggaaattg 
tcagccggag 
ataacagacg 
cctggttttt 



gaattcatgt 
cccggctgct 
tctagatccg 
ctcctctctg 
gtcgacaaga 
cgacgtcatt 
cgacgatttt 
tctagttctg 
agaagggtgt 
tttttttacc 
ttggtctttg 
acgcgcaaag 
aatccaagaa 
agctgttgga 
aagacttttc 
tcatgatctt 
agggaggagt 
gctcgcgaat 
gtctgctcat 
aaaagacggt 
acgacatggg 
tcgacgacac 
cgggaactga 
tggaaggcga 
ttcgggtcac 
gccaaactct 
cccgatggag 
gcccagtagg 
gtatgaattg 
cagatcaagc 
gattgccagc 
agctttgatc 
ggctttccag 
cacaaactca 



acaaaatcat 
cagttaccac 
actctctcac 
gagctgagct 
ttggtgtaca 
ttgctgtctc 
ctggaattcc 
ctttgctaga 
ttttttgcta 
cggcactttt 
cggggagatt 
ggcggcggct 
gacgaacttg 
caagctggga 
gtacgaaaac 
cgaggaccga 
ttaccaaatc 
tgtctcgggt 
gctggctgag 
tgaaatcgca 
tggccgcgat 
cggtgacgct 
catcataatc 
aagatacaga 
gtgactatat 
tcggccaact 
gtagccggta 
gagagatggg 
tagtagagct 
gactccagtc 
agggttgtgg 
gatttcgagt 
tactggtacc 
aacacggaga 



ttgtgcgggc 
ctctcatttg 
tgtgcttgag 
tgttggaggt 
actggggcaa 
tagtaactta 
tgttgttctt 
ctgttggcca 
ccaaaagctg 
tgaccccata 
ttggcagata 
catccgtcgc 
tgtgcctcgg 
ccgttcattt 
accgtggtgc 
aaatttgccg 
gcaaagtggg 
ctcagacagg 
ctgtcgtctg 
aagtccgaca 
gagggcttcg 
ctgggccagc 
gtgggcaggg 
aaggctggat 
agctattggt 
cgatgctcag 
gacttgttgt 
acttgcggta 
taaagaaacg 
tggacagatg 
cggcactggc 
gtttgtccag 
agtttatcat 
agtagaacgc 



gcagagacga 
gttcatgagc 
gaacttctca 
gaagtcattg 
aataatgtta 
ccctcatgaa 
tgtagttgag 
atatctggtt 
aaaattctga 
ctagttgtac 
aacaggcgac 
ctgtggccag 
tcgatctcac 
gtctggtcaa 
cgctgctgaa 
atataggcaa 
ccgatatcac 
ctgcccagga 
aaggctcgct 
gagattttgt 
actggctcat 
agtaccgcac 
gactgttcgg 
gggacgctta 
atgtacaaga 
tttctggcgt 
tgcagttctc 
tctggtcatc 
gcttctctct 
gaccttctgg 
tccagccaaa 
gtccgggttc 
gctctgagtt 
aaacccaaag 



gcgacgagtt 
atttgattct 
gcacacttgt 
acacagtcgt 
gtcgtggttc 
gtcgttcatc 
ttgatccatt 
atccctctag 
aaaattttcg 
caaactgaaa 
tatgtcctcg 
aagactgctg 
ctctacaaag 
gacacacatc 
actggccaag 
caccgtcaaa 
caacgcccac 
gaccaccgac 
cgcgtacgga 
gatcggtttc 
catgacccca 
ggtcagcgcc 
caagggaaga 
tttgagtcgt 
attaattagc 
gaaatttcga 
gcgaatcccc 
atgatttcga 
agatggtggg 
attttgttga 
agaatgaagg 
ttctctccgt 
ctgtggaagc 
ctt 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3113 



<210> 16 
<211> 265 
<212> PRT 

<213> Ogataea minuta 
<400> 16 

Met Ser Ser Thr Lys Thr Tyr Ala Gin Arg Ala Ala Ala His Pro Ser 
15 10 15 

Pro Val Ala Arg Arg Leu Leu Asn Leu Met Glu Ser Lys Lys Thr Asn 
2 0 25 3 0 

Leu Cys Ala Ser Val Asp Leu Thr Ser Thr Lys Asp Leu Leu Glu Leu 
35 40 45 

Leu Asp Lys Leu Gly Pro Phe lie Cys Leu Val Lys Thr His lie Asp 
50 55 60 

lie Val Glu Asp Phe Ser Tyr Glu Asn Thr Val Val Pro Leu Leu Lys 
65 70 75 80 



10 



Leu Ala Lys Lys His Asn Phe Met lie Phe Glu Asp Arg Lys Phe Ala 
85 90 95 

Asp lie Gly Asn Thr Val Lys Leu Gin Tyr Lys Gly Gly Val Tyr Gin 
100 105 110 

lie Ala Lys Trp Ala Asp lie Thr Asn Ala His Gly Val Thr Gly Ser 
115 120 125 

Arg lie Val Ser Gly Leu Arg Gin Ala Ala Gin Glu Thr Thr Asp Glu 
130 135 140 

Pro Arg Gly Leu Leu Met Leu Ala Glu Leu Ser Ser Glu Gly Ser Leu 
145 150 155 160 

Ala Tyr Gly Glu Tyr Thr Lys Lys Thr Val Glu lie Ala Lys Ser Asp 
165 170 175 

Arg Asp Phe Val lie Gly Phe lie Ala Gin Asn Asp Met Gly Gly Arg 
180 185 190 

Asp Glu Gly Phe Asp Trp Leu lie Met Thr Pro Gly Val Gly Leu Asp 
195 200 205 

Asp Thr Gly Asp Ala Leu Gly Gin Gin Tyr Arg Thr Val Ser Ala Val 
210 215 220 

Met Lys Thr Gly Thr Asp lie lie lie Val Gly Arg Gly Leu Phe Gly 
225 230 235 240 

Lys Gly Arg Asp Pro Val Val Glu Gly Glu Arg Tyr Arg Lys Ala Gly 
245 250 255 

Trp Asp Ala Tyr Leu Ser Arg Val Ala 
260 265 



<210> 17 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer for amplification of a gene fragment 
conferring resistance against chloramphenicol 

<400> 17 

atggagaaaa aaactagtgg atataccacc 



<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
primer for amplification of a gene fragment 
conferring resistance against chloramphenicol 

<400> 18 

ctgagacgaa aaagatatct caataaaccc 3 0 



<210> 19 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DU5 used for confirmation of destruction 
of Ogataea minuta URA3 gene 

<400> 19 

aggaagaaga ggaggaagag gaagaaac 2 8 



<210> 20 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DUC5 used for confirmation of destruction 
of Ogataea minuta URA3 gene 

<400> 20 

cgatgccatt gggatatatc aacggtgg 28 



<210> 21 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DU3 used for confirmation of destruction 
of Ogataea minuta URA3 gene 

<400> 21 

ccgtgtttga gtttgtgaaa aaccagggc 29 



<210> 22 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DUC3 used for confirmation of destruction 
of Ogataea minuta URA3 gene 

<400> 22 

tgtggcgtgt tacggtgaaa acctggcc 



<210> 23 
<211> 14 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 23 

Phe Val Ala Thr Asp Arg lie Ser Ala Tyr Asp Val lie Met 
1 5 10 



<210> 24 
<211> 14 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 24 

Gin Asp Ser Tyr Asp Lys Gin Phe Leu Arg Asp Trp Leu Thr 
15 10 



<210> 25 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PADS for amplification of 5 1 -region of 
Ogataea minuta ADE1 gene 

<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (9) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 



<220> 

<221> modif ied_base 

<222> (24) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (36) 

<223> a, c, g or t 

<400> 25 

ttygtngcna cngaymgnat hwsngcntay gaygtnatha tg 

<210> 26 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PAD 3 for amplification of 3 1 -region of 
Ogataea minuta ADE1 gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, gort 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (33) 

<223> a, c, g or t 

<400> 26 

gtnarccart cncknarraa ytgyttrtcr tanswrtcyt g 



<210> 27 

<211> 2560 

<212> DNA 

<213> Ogataea minuta 
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<400> 27 

gatatcccaa 

cgtttcctga 

tgcgtgtggg 

acgcagtcac 

actttgtgag 

cgttgacgag 

caggcatcac 

acgtgacatt 

tcagcgtggc 

agaggaaaag 

agaaatctt t 

tgtgaagagg 

gtttttctga 

gcgtcgaaaa 

tgtaatttaa 

acagatcagc 

tcgacggcat 

aggaaagcct 

atggaatcaa 

tgaaagacac 

ttccacagga 

tgagaaagca 

catggaagga 

aggagagtca 

aacacgacga 

cgcggctcgc 

agggtatcat 

ttcttgtgga 

agatcggaca 

acggtctgaa 

cgaagtacgt 

tagtaaccct 

attgaattta 

gacggcattt 

gtgaagtttg 

gaacaccagt 

ggttcctttg 

aaggagatct 

cgtattctca 

caggaggaac 

ttgatcagga 

ttcacccagg 

caagaacttt 



gaacctatgc 
gtttcctcca 
cacgaacgag 
catcaattcc 
ataagctctg 
atagttgagc 
aagatccacc 
taacacaaga 
aaaatcgatg 
gaccactgaa 
atgcgtagtc 
attcgataaa 
ccttttttgc 
gagtcaagcc 
accatcacag 
taaccaggac 
cttgccgcta 
gctgttcgtg 
agacaagggt 
tatcaagaac 
gctgtctcag 
caaattgatc 
gtacaacaag 
agagttcccc 
aaacatttcc 
agaaaaggct 
tctcgccgac 
cgaggttttg 
gtcgcaggac 
cgggaaagac 
cgaggcattt 
ggaaatttga 
gtcgcttccc 
gagccatccc 
acaacaggct 
gcgccatgtc 
gatgtgttcg 
cctacatcgc 
gagaactaaa 
agaccgtcca 
agtacaaagg 
ttctcaacgc 
tccagtccac 



cgagggttca 
acggccagga 
cccacgtaga 
agaaagttct 
ctggcaagaa 
tttgttccgt 
aacgtcagag 
tcccgccagt 
cttgcgcctt 
gggagaaata 
ttggggttca 
gagaggggaa 
tctttctcgt 
gcgatcgcga 
tatataagta 
attccactgg 
attgccaagg 
gcaacagacc 
aaaatactga 
caccttatcg 
ccaaagtaca 
cccctggagg 
agcaagaccg 
gttccgattt 
cccgagaaag 
gtgcagctgt 
acaaagtttg 
acccctgatt 
tcttacgaca 
ggtgtctcta 
gagtctctga 
tatgtggcgg 
aaaacagcaa 
agggtctggc 
acagctgcca 
catcgccaac 
gaaagtgcgc 
catgaacacc 
gcatcccaac 
tctgtacatg 
aaagaacgag 
tctctatcaa 
tccagagatt 



gctcacggcc 
ttatctcgtg 
caaacaggct 
cgtggatgaa 
ttcccacgag 
ctcgtaacag 
tgaagaacca 
tgccgactat 
caaccacata 
ctgacaaaac 
attcagacat 
cagcaccgga 
ttagctcgcg 
ttaaaaatga 
ggcgggaagc 
agccaacaat 
gcaaagtcag 
ggatttccgc 
ctcagctgtc 
catccactga 
agtcgcagct 
tgattgtcag 
tgcacggtct 
tcaccccgtc 
ctgcagagat 
actccaaggc 
agtttggaat 
cctcgagatt 
aacagtttct 
tgaccgcgga 
cgggaagaaa 
tgtagtctgt 
tttgtcaaca 
agttacaggg 
agtcgcaact 
gagttccagc 
cgcaagtcgg 
aaagagaagc 
attgtccatt 
gaatactgcg 
tttatcccgg 
tgccactatg 
gcaccccggg 



gataaaccaa 
agttcccaga 
caaagccaac 
cgacagctca 
aagagtgctc 
gactccctct 
caccaggtaa 
cttggactcg 
ctcctcatta 
gaccgctccc 
ggtagattgg 
gatagttctt 
tacagtcgac 
atccggagaa 
gcacaatttc 
gtcactcaca 
agacatctat 
ctacgatgtg 
agtattctgg 
cgacgaagtg 
gagtggaaga 
aggctacatc 
cgaggttggc 
aacgaaagct 
tgtcggggaa 
cagaacttac 
tgacgagaac 
ttgggacgca 
gagagactgg 
gatcgctgaa 
gtggacgtag 
ggcggtggaa 
cttagtctgt 
ctttgatcaa 
tgggtagtag 
ccttggagct 
acggcaagat 
agcagctcac 
atgtccacca 
atgggggcga 
agaacttgat 
gggtcaatat 



tcaaagacaa 
ccgttcggct 
gaaaactcgt 
ggaaggttga 
aattctttcc 
ttatggtagc 
accttccagc 
aaaagcgttt 
cagcaaaagt 
ggtgtcccgc 
tgagggtaat 
agatcaaaat 
gcgtcggttt 
gtcaaaaata 
taggcattcc 
acaaccaacc 
caagttgacg 
atcatggaga 
tttgatttgc 
tttgccagac 
gcactggtgg 
accggaagtg 
gcagagctga 
gaacaaggcg 
caactgtgtg 
gccaaaagca 
gacgaattgg 
aagacttaca 
ctcacgtcca 
cgcacgggtg 
tttttgataa 
taaaatctaa 
gcacagcctt 
aagaaaactg 
ctcattcgtc 
tattggtagg 
atttgtgaga 
agcagagttt 
cgaccacgtc 
cttgtcggtg 
ctggcaaatc 
tgaggctgtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2560 



<210> 28 
<211> 303 
<212> PRT 

<213> Ogataea minuta 
<400> 28 

Met Ser Leu Thr Thr Thr Asn Leu Asp Gly lie Leu Pro Leu lie Ala 
15 10 15 

Lys Gly Lys Val Arg Asp lie Tyr Gin Val Asp Glu Glu Ser Leu Leu 
20 25 30 
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Phe Val Ala Thr Asp Arg lie Ser Ala Tyr Asp Val lie Met Glu Asn 
35 40 45 

Gly lie Lys Asp Lys Gly Lys lie Leu Thr Gin Leu Ser Val Phe Trp 
50 55 60 

Phe Asp Leu Leu Lys Asp Thr lie Lys Asn His Leu He Ala Ser Thr 
65 70 75 80 

Asp Asp Glu Val Phe Ala Arg Leu Pro Gin Glu Leu Ser Gin Pro Lys 
85 ,90 95 

Tyr Lys Ser Gin Leu Ser Gly Arg Ala Leu Val Val Arg Lys His Lys 
100 105 110 

Leu He Pro Leu Glu Val He Val Arg Gly Tyr He Thr Gly Ser Ala 
115 120 125 

Trp Lys Glu Tyr Asn Lys Ser Lys Thr Val His Gly Leu Glu Val Gly 
130 135 140 

Ala Glu Leu Lys Glu Ser Gin Glu Phe Pro Val Pro He Phe Thr Pro 
145 150 155 160 

Ser Thr Lys Ala Glu Gin Gly Glu His Asp Glu Asn He Ser Pro Glu 
165 170 175 

Lys Ala Ala Glu He Val Gly Glu Gin Leu Cys Ala Arg Leu Ala Glu 
180 185 190 

Lys Ala Val Gin Leu Tyr Ser Lys Ala Arg Thr Tyr Ala Lys Ser Lys 
195 200 205 

Gly He He Leu Ala Asp Thr Lys Phe Glu Phe Gly He Asp Glu Asn 
210 215 220 

Asp Glu Leu Val Leu Val Asp Glu Val Leu Thr Pro Asp Ser Ser Arg 
225 230 235 240 

Phe Trp Asp Ala Lys Thr Tyr Lys He Gly Gin Ser Gin Asp Ser Tyr 
245 250 255 

Asp Lys Gin Phe Leu Arg Asp Trp Leu Thr Ser Asn Gly Leu Asn Gly 
260 265 270 

Lys Asp Gly Val Ser Met Thr Ala Glu He Ala Glu Arg Thr Gly Ala 
275 280 285 

Lys Tyr Val Glu Ala Phe Glu Ser Leu Thr Gly Arg Lys Trp Thr 
290 295 300 



<210> 29 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
5 1 -primer for amplification of upstream region 
of URA3 structural gene 

<400> 29 

ccccgagctc aaaaaaaagg taccaatttc agctccgacg ccggagccca ctacgcctac 60 



<210> 30 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
3 1 -primer for amplification of upstream region 
of URA3 structural gene 

<400> 30 

gggaagcttc cccagttgta caccaatctt gtcgacag 3 8 



<210> 31 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer Dadl-5 used for destruction of Ogataea 
minuta ADE1 gene 

<400> 31 

aaaaagcggc cgctcccggt gtcccgcaga aatctttatg cgtagtcttg 50 



<210> 32 
<211> 56 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer Dadl-3 used for destruction of Ogataea 
minuta ADE1 gene 

<400> 32 

cccccggatc ctttttttta agcttgttgt actccttcca tgcacttccg gtgatg 56 



<210> 33 

<211> 59 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
primer Dad2-5 used for destruction of Ogataea 
minuta ADE1 gene 

<400> 33 

ttttcacccc gtcaaggatc cctgaacaag gcgaacacga cgaaaacatt tcccccgag 59 



<210> 34 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer Dad2-3 used for destruction of Ogataea 
minuta ADE1 gene 

<400> 34 

tttttgggcc cacctgggtg aagatttgcc agatcaagtt ctcc 44 



<210> 35 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DA5 used for confirmation of destruction 
of Ogataea minuta ADE1 gene 

<400> 35 

gatgcttgcg ccttcaacca catactcctc 30 



<210> 36 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DA3 used for confirmation of destruction 
of Ogataea minuta ADE1 gene 

<400> 36 

aaaagttctt gcacagcctc aatattgacc 30 



<210> 37 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DOU5 used for confirmation of destruction 
of Ogataea minuta ADE1 gene 

<400> 37 

atcgatttcg agtgtttgtc caggtccggg 



<210> 38 
<211> 10 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> M0D_RES 

<222> (3) 

<223> His or Arg 

<220> 

<221> MOD_RES 

<222> (4) 

<223> He or Val 

<400> 38 

Pro Gin Xaa Xaa Trp Gin Thr Trp Lys Val 
15 10 



<210> 39 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 39 

Trp Tyr Ala Arg Arg He Gin Phe Cys Gin Trp 
15 10 



<210> 40 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer P0H5 for amplification of 5 1 -region of 
Ogataea minuta OCH1 gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, g or t 
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<400> 40 

ccncarcryr thtggcarac ntggaargt 



29 



<210> 41 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer P0H3 for amplification of 3 1 -region of 
Ogataea minuta OCH1 gene 

<220> 

<221> modif ied_base 

<222> (19) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (22) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (25) 

<223> a, c, g or t 



<400> 41 

ccaytgrcar aaytgdatnc knckngcrta cca 



33 



<210> 42 

<211> 2527 

<212> DNA 

<213> Ogataea minuta 



<400> 42 

agatctgttg 

ccggcacaca 

attgttacct 

tctcaaggtg 

atggatctgc 

attcaggcac 

gaagtgactg 

tcgagccgtt 

tagtcatctc 

cggcagtcgt 

gtggtcattt 

gagtcgattc 

ggaatggatc 

ctatcgtttt 

tggaaagtgg 

tggccacaac 

ttcatgagga 

aaaaatatcc 



acactggtca 
aggacaataa 
caataagttg 
gtgtacgggt 
agggggctcg 
aacggtacca 
aagagacgag 
tttttaggga 
tacgttcagc 
tgatgcgaaa 
taacgatgta 
caggactcac 
tgcagaagcg 
acttccccta 
acatcaacga 
taaacagcgg 
gtctttttgc 
tcaaggcgga 



agcgtgtagc 
gggcgtccgg 
ctctacgatt 
gtttcatttt 
gctgattgca 
acggggcagg 
aaggagacga 
cagaaaccgt 
acacaccatg 
ggcgcgaaag 
cgttgtgtcg 
catgaaagag 
gaacgagcta 
tgaccctgaa 
caaatcattc 
gtacacgtac 
caatgtccct 
ttttttccgg 



caagagaata 
ggctgtcgaa 
gtttccgtct 
taatttgcat 
ctgaaatttc 
cttgatcgcg 
atcagcctac 
tctggatatt 
aactatcacg 
ttcgctgaga 
cgtctggcgt 
tcagagttag 
ccggccgcaa 
aaaccagtgc 
ccgagacact 
catctcattc 
gaggttattg 
tatttggtga 



ggaaacggaa 
attgtcgaga 
ttgacaaagc 
cgagaacgcg 
agcaataaat 
aagcagcagg 
ccctggaacc 
tattcgacgc 
acttgtacga 
tgaacaagaa 
cggttggaag 
aggtgaattt 
gtgcaacgct 
ccaaccaaat 
tccgtaagtt 
cagacagtat 
cagcctacaa 
tttttgcgcg 



tttcatactg 
ccgtagagct 
agtaggcctt 
tagtgcgcca 
agctgaggat 
agaaggcagc 
ataaacaaag 
agagactcgg 
tgatagcaaa 
gttggtggtg 
cacgaaacag 
taaaacattt 
gagagaaaaa 
atggcagacg 
ccaagagaca 
tgtggacgag 
catgttaccg 
cggtggaact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



\ 
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tattcggata 
caaactgtca 
gacccctcca 
agacccgact 
ggcaagcaag 
cgcaaagagg 
atgcagtgga 
gtgagtgacg 
gccaagtaca 
aagcaaacta 
gtgatgatcc 
cccgaccacc 
gagaagatgt 
acttttgcgg 
gcttggacgg 
aacgagagag 
acgactgcgt 
aagatgcaac 
cgatgactat 
atttgagggg 
tggctgattg 
aacaggagat 
ttcgagagtt 
aagaggagaa 
tactagt 



tcgacacgat 
tttcgcacta 
cgcgcaaaac 
ggcacgaatg 
gccatcccat 
ccatgggcaa 
caggacccgg 
gaaagctggg 
agctgtctca 
tcagctggaa 
tgccaatcac 
cgctcgcatt 
gactgcatat 
ggaagtggtg 
aagaacgggt 
tagcttgcag 
cttttctttc 
aaggtatctt 
gacagacagg 
cacgcactca 
gagagagtgg 
caattgtctt 
ttcgaaagag 
gggaggccac 



ctgcctcaaa 
tctcaagacc 
accgatcgga 
gtacgctaga 
gctgcgcgag 
tttgaaaaaa 
ggttttcaca 
agacggttac 
cattgaggtg 
gtccatgagt 
tagcttcagc 
tgtccggcac 
aggaacgcat 
taagggtgtt 
gcacgccgca 
attgcaatcc 
tggtgtacgg 
ctgtagctaa 
tttttggagg 
cggcttttca 
acagatgatt 
gttcttaact 
ctgggattcg 
caagaggacg 



ccagtgaacg 
aacggtaaaa 
ctcaccattg 
cgtattcagt 
ttgatcatcc 
gtagagggga 
gataccctgt 
ggagtcgggt 
gatgccaaca 
aagctatcgg 
cccggcgtgg 
atgttccagg 
tttatacagt 
tgacgagggc 
tggggctgtt 
cgactgagtc 
gtgtcaatac 
accccacttc 
attacaagaa 
cgacggcggg 
tgggtcattc 
accttatcta 
attttatcgt 
gagagtacac 



aatgggccac 
cctcgcagtt 
gaatagaggc 
tctgtcaatg 
gtatagtgga 
aggatatggg 
ttgattatct 
ccaagtactg 
acgagccgat 
agcccctgat 
gccagatggg 
gcagctggaa 
agatcaagtt 
ctgaacccgt 
cgttcagttt 
cacccggttg 
attttcggtt 
tccagacacc 
gtttctcccc 
cggggccgat 
gcaggagtat 
cgaaggatat 
ggaggagttg 
gaccatgtca 



gtttaacgaa 
gccagaagtg 
cgacccagac 
gacgatccag 
gcaaactttc 
tggtgacatc 
caataacgtg 
gaacagtcac 
gcactctgac 
tatagatgac 
ctcgcattcg 
accagatgca 
aaaagtttga 
gagtcaacgc 
tgacgctgct 
agctagtcac 
taaaaacgat 
ttccaccagc 
aaagcgcacg 
gcggggagtt 
tacgagctga 
gttggtgctg 
gaaggaattg 
gacactgacg 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2527 



<210> 43 

<211> 434 

<212> PRT 

<213> Ogataea minuta 



<400> 43 
Met Asn Tyr His 
1 

Arg Lys Ala Arg 
20 

Val He Leu Thr 
35 

Thr Lys Gin Glu 
50 

Glu Val Asn Phe 
65 

Leu Pro Ala Ala 



Pro Tyr Asp Pro 
100 

Lys Val Asp lie 
115 



Asp Leu Tyr Asp 
5 

Lys Phe Ala Glu 



Met Tyr Val Val 
40 

Ser He Pro Gly 
55 

Lys Thr Phe Gly 
70 

Ser Ala Thr Leu 
85 

Glu Lys Pro Val 



Asn Asp Lys Ser 
120 



Asp Ser Lys Arg 
10 

Met Asn Lys Lys 
25 

Ser Arg Leu Ala 



Leu Thr Met Lys 
60 

Met Asp Leu Gin 
75 

Arg Glu Lys Leu 
90 

Pro Asn Gin He 
105 

Phe Pro Arg His 



Gin Ser Leu Met 
15 

Leu Val Val Val 
30 

Ser Val Gly Ser 
45 

Glu Ser Glu Leu 



Lys Arg Asn Glu 
80 

Ser Phe Tyr Phe 
95 

Trp Gin Thr Trp 
110 

Phe Arg Lys Phe 
125 
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Gin Glu Thr Trp Pro Gin Leu Asn Ser Gly Tyr Thr Tyr His Leu lie 
130 135 140 

Pro Asp Ser lie Val Asp Glu Phe Met Arg Ser Leu Phe Ala Asn Val 
145 150 155 160 

Pro Glu Val lie Ala Ala Tyr Asn Met Leu Pro Lys Asn lie Leu Lys 
165 170 175 

Ala Asp Phe Phe Arg Tyr Leu Val lie Phe Ala Arg Gly Gly Thr Tyr 
180 185 190 

Ser Asp lie Asp Thr lie Cys Leu Lys Pro Val Asn Glu Trp Ala Thr 
195 200 205 

Phe Asn Glu Gin Thr Val lie Ser His Tyr Leu Lys Thr Asn Gly Lys 
210 215 220 

Thr Ser Gin Leu Pro Glu Val Asp Pro Ser Thr Arg Lys Thr Pro lie 
225 230 235 240 

Gly Leu Thr lie Gly lie Glu Ala Asp Pro Asp Arg Pro Asp Trp His 
245 250 255 

Glu Trp Tyr Ala Arg Arg lie Gin Phe Cys Gin Trp Thr lie Gin Gly 
260 265 270 

Lys Gin Gly His Pro Met Leu Arg Glu Leu lie lie Arg lie Val Glu 
275 280 285 

Gin Thr Phe Arg Lys Glu Ala Met Gly Asn Leu Lys Lys Val Glu Gly 
290 295 300 

Lys Asp Met Gly Gly Asp lie Met Gin Trp Thr Gly Pro Gly Val Phe 
305 310 315 320 

Thr Asp Thr Leu Phe Asp Tyr Leu Asn Asn Val Val Ser Asp Gly Lys 
325 330 335 

Leu Gly Asp Gly Tyr Gly Val Gly Ser Lys Tyr Trp Asn Ser His Ala 
340 345 350 

Lys Tyr Lys Leu Ser His lie Glu Val Asp Ala Asn Asn Glu Pro Met 
355 360 365 

His Ser Asp Lys Gin Thr lie Ser Trp Lys Ser Met Ser Lys Leu Ser 
370 375 380 

Glu Pro Leu lie lie Asp Asp Val Met lie Leu Pro lie Thr Ser Phe 
385 390 395 400 

Ser Pro Gly Val Gly Gin Met Gly Ser His Ser Pro Asp His Pro Leu 
405 410 415 

Ala Phe Val Arg His Met Phe Gin Gly Ser Trp Lys Pro Asp Ala Glu 
420 425 430 



Lys Met 
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<210> 44 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer D03 used for confirmation of destruction 
of Ogataea minuta OCH1 gene 

<400> 44 

ccattgtcag ctccaattct ttgataaacg 



<210> 45 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer DOS used for confirmation of destruction 
of Ogataea minuta OCH1 gene 

<400> 45 

acacttccgt aagttccaag agacatggcc 



<210> 46 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer D03-2 used for confirmation of destruction 
of Ogataea minuta OCH1 gene 

<400> 46 

tcaccacgtt attgagataa tcaaacaggg 



<210> 47 

<211> 8 

<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 47 

Thr Asn Tyr Leu Asn Ala Gin Tyr 
1 5 
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<210> 48 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 48 

Lys Ala Tyr Trp Glu Val Lys Phe 
1 5 



<210> 49 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PPA5 for amplification of 5 1 -region of 
Ogataea minuta PEP4 gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 

<400> 49 

acnaaytayy tnaaygcnca rta 



<210> 50 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PPA3 for amplification of 3 1 -region of 
Ogataea minuta PEP4 gene 

<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 
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<400> 50 

aayttnacyt cccartangc ytt 
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<210> 51 

<211> 1951 

<212> DNA 

<213> Ogataea minuta 
<220> 

<221> modif ied_base 

<222> (35) 

<223> a, c, g or t 



<400> 51 

catatgtatt 

tgacccgcca 

gctaatcacg 

tttcgataca 

acccccattc 

ctctgaatct 

ttcattgaac 

gtttttttcc 

aactctcgct 

atgcgccaat 

acgtggattc 

cccaaaacgc 

agggacacaa 

ttggtacccc 

ttccaggctc 

cttcgtcaac 

tggagggttt 

actttgccga 

ttctcggact 

ctttgaacct 

agtctgaggc 

gaaagatcac 

tcgctcttgg 

cctctttaat 

agaagtcctg 

tcactttcaa 

tttcaggctc 

tcgccatcat 

acgctgttgg 

ttaattctgc 

tggtatgttt 

tgcactactt 

tgaattgcac 



catcaatcta 
cgaccttgct 
aagatttttg 
tttgttatcc 
tagtggcaaa 
tccttctgat 
ttgcgaagcg 
ccacaaaaag 
tgcattgctc 
caagaagact 
tctgaagggc 
ctttgtccct 
ctcccctctc 
gggccaaccg 
ggattgttct 
ctacaaggcc 
tgtctcccag 
ggccaccagt 
tgcgtacgac 
ggggcttttg 
agacggtgga 
ttggttgcca 
tgacgagtac 
tgctttgcct 
gtccggccag 
ctttgacggt 
gtgcatttcc 
tggtgatgct 
attggctaag 
catcctggat 
attgctttta 
cactctttca 
aaatttaaag 



cagcttttct 
ggcttccagc 
gatttcctga 
gaattgggaa 
caggattgaa 
atcaatcaac 
ctcacacagc 
gctcacgctg 
gcccttggtg 
cctgccgcgg 
aagtatgtct 
tttgttcagg 
acgaactacg 
ttcaaggtca 
tctcttgctt 
aacggctctg 
gacaccctga 
gagccaggtc 
accatctcgg 
gacgagcctc 
gtggctactt 
gtgagaagaa 
gcgactttag 
tcccaattgg 
tacaccattg 
tacaatttct 
gccttcactc 
ttcctgagaa 
gccgtttaat 
tggcatgaat 
ttacgtgacc 
ttggctttgg 
attgctctag 



aatcngtgtg 
gcgcgaaact 
tctgtagtgt 
tggcattagt 
agagggctaa 
aattgttaaa 
atcgttcggt 
cctcctcact 
gtttccaaga 
aaacttacaa 
ctatgtttgc 
aagtgcaaga 
tgaacgctca 
tcctcgacac 
gctacctgca 
aatttgctat 
ctcttggtga 
tcgcatttgc 
tggacaagat 
agttcgcctt 
tcggaggtgt 
aggcttactg 
acggatatgg 
ctgagatttt 
actgtgaaaa 
ctatctccgc 
cgatggactt 
agtattactc 
ctctagcctt 
ggttggttgg 
aaatgttggt 
aagtacgtta 
a 



acttcagtca 
cactcccaat 
atccatcctg 
cgtgcgccac 
aaggtaactt 
cgattgaaag 
tagcagttac 
cttgcctctt 
ggcccacgcc 
ggacgtgagt 
taagcatgct 
cccagagttt 
gtacttcact 
tggttcgtcc 
tcagaagtac 
cagatacggc 
cctcatcatt 
ctttggtaag 
tgttcctcct 
ctacctcgga 
tgacgaaact 
ggaggtgaag 
cgctgccatc 
gaactctcaa 
gagagcatct 
gtacgactac 
ccctgcccca 
cgtgtacgac 
ctagttattg 
tacgcatata 
ttttctttca 
tttttttcac 



catgatcctc 
tttcggatta 
ccttaatcgt 
ccgactcgcc 
agtgttttat 
ttttgaaaca 
aacagtttag 
ttcttgatga 
aaggttcatc 
ttcggcgact 
gcggagtcct 
actgttcagg 
gagattcaaa 
aatttgtggg 
gaccacgact 
tctggttcgc 
ccaaagcaag 
tttgacggta 
atctacaacg 
gacactgcca 
aagtacgacg 
tttgacggta 
gacacaggta 
atcggtgccg 
ttgccagacc 
actcttgagg 
attggccctc 
ttgggcaagg 
attgctattg 
cggttggcgg 
ccttttactc 
cctatgtaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1951 



<210> 52 
<211> 410 
<212> PRT 

<213> Ogataea minuta 
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<400> 52 

Met Lys Leu Ser Leu Ala Leu Leu Ala Leu Gly Gly Phe Gin Glu Ala 
15 10 15 

His Ala Lys Val His His Ala Pro lie Lys Lys Thr Pro Ala Ala Glu 
20 25 30 

Thr Tyr Lys Asp Val Ser Phe Gly Asp Tyr Val Asp Ser Leu Lys Gly 
35 40 45 

Lys Tyr Val Ser Met Phe Ala Lys His Ala Ala Glu Ser Ser Gin Asn 
50 55 60 

Ala Phe Val Pro Phe Val Gin Glu Val Gin Asp Pro Glu Phe Thr Val 
65 70 75 80 

Gin Glu Gly His Asn Ser Pro Leu Thr Asn Tyr Val Asn Ala Gin Tyr 
85 90 95 

Phe Thr Glu He Gin He Gly Thr Pro Gly Gin Pro Phe Lys Val He 
100 105 110 

Leu Asp Thr Gly Ser Ser Asn Leu Trp Val Pro Gly Ser Asp Cys Ser 
115 120 125 

Ser Leu Ala Cys Tyr Leu His Gin Lys Tyr Asp His Asp Ser Ser Ser 
130 135 140 

Thr Tyr Lys Ala Asn Gly Ser Glu Phe Ala He Arg Tyr Gly Ser Gly 
145 150 155 160 

Ser Leu Glu Gly Phe Val Ser Gin Asp Thr Leu Thr Leu Gly Asp Leu 
165 170 175 

He He Pro Lys Gin Asp Phe Ala Glu Ala Thr Ser Glu Pro Gly Leu 
180 185 190 

Ala Phe Ala Phe Gly Lys Phe Asp Gly He Leu Gly Leu Ala Tyr Asp 
195 200 205 

Thr He Ser Val Asp Lys He Val Pro Pro He Tyr Asn Ala Leu Asn 
210 215 220 

Leu Gly Leu Leu Asp Glu Pro Gin Phe Ala Phe Tyr Leu Gly Asp Thr 
225 230 235 240 

Ala Lys Ser Glu Ala Asp Gly Gly Val Ala Thr Phe Gly Gly Val Asp 
245 250 255 

Glu Thr Lys Tyr Asp Gly Lys He Thr Trp Leu Pro Val Arg Arg Lys 
260 265 270 

Ala Tyr Trp Glu Val Lys Phe Asp Gly He Ala Leu Gly Asp Glu Tyr 
275 280 285 

Ala Thr Leu Asp Gly Tyr Gly Ala Ala He Asp Thr Gly Thr Ser Leu 
290 295 300 
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lie Ala Leu Pro Ser Gin Leu Ala Glu lie Leu Asn Ser Gin He Gly 
305 310 315 320 

Ala Glu Lys Ser Trp Ser Gly Gin Tyr Thr He Asp Cys Glu Lys Arg 
325 330 335 

Ala Ser Leu Pro Asp Leu Thr Phe Asn Phe Asp Gly Tyr Asn Phe Ser 
340 345 350 

He Ser Ala Tyr Asp Tyr Thr Leu Glu Val Ser Gly Ser Cys He Ser 
355 360 365 

Ala Phe Thr Pro Met Asp Phe Pro Ala Pro He Gly Pro Leu Ala He 
370 375 380 

He Gly Asp Ala Phe Leu Arg Lys Tyr Tyr Ser Val Tyr Asp Leu Gly 
385 390 395 400 

Lys Asp Ala Val Gly Leu Ala Lys Ala Val 
405 410 



<210> 53 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> M0D_RES 

<222> (2) 

<223> Gly or Leu 

<400> 53 

Asp Xaa Asn Gly His Gly Thr His Cys Ala Gly 
15 10 



<210> 54 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> M0D_RES 

<222> (6) 

<223> Ser or Thr 

<220> 

<221> M0D_RES 

<222> (9) 

<223> Val or He 

<220> 

<221> MOD_RES 

<222> (10) 

<223> Ala or Val 
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<400> 54 

Gly Thr Ser Met Ala Xaa Pro His Xaa Xaa Gly 
15 10 

<210> 55 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PPB5 for amplification of 5 1 -region of 
Ogataea minuta PRB1 gene 

<220> 

<221> modif ied__base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c # g or t 

<220> 

<221> modif ied_base 

<222> (30) 

<223> a, c, g or t 

<400> 55 

gaybknaayg gncayggnac ncaytgykcn gg 3 2 



<210> 56 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PPB3 for amplification of 3 1 -region of 
Ogataea minuta PRB1 gene 

<220> t 
<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
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<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (24) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a # c, g or t 

<220> 

<221> modif ied_base 

<222> (30) 

<223> a, c, g or t 



<400> 56 

ccnrcnayrt gnggnwsngc catnwsngtn cc 
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<210> 57 
<211> 2214 
<212> DNA 

<213> Ogataea minuta 



<400> 57 

ggatcccctc 

agctttcacc 

cagatcccaa 

aagagggtgt 

ttgccataga 

tttttcagaa 

ttatcttatc 

gctattctgt 

ccagcagagt 

gtcgactcgg 

gtcacatcgc 

aatggcctag 



tctcgctagc 
ccaacacgtc 
atttgccacc 
tttgacgttt 
ttatcttgcg 
cagaaaaatc 
tcacctcaac 
cttcgttggc 
ttgcccttgt 
ccattagtga 
tgatcgcatc 
cttccggggc 



gagtttcgcc 
actttctgca 
agactaaatt 
catatacatt 
cagttcagca 
gccaatcaca 
cacttgaacc 
agcagtggag 
tccaatggat 
tgctcctatc 
tcaaaatttg 
agttgacttc 



tgctctgcga 
gtcgtgcgcc 
ggggcattct 
gctctttctt 
tccttaggag 
gaaagattca 
aaaatgaagt 
gccttggtca 
gcgaaagcgg 
gcggcactaa 
attccaaact 
cacatggagt 



taagagaaaa 
ggcttgcatt 
ggtgagggaa 
ttcttggacg 
ttattctttc 
gtcctaattg 
tatcccagtc 
tcccgttatt 
aagtcatttc 
atgatgctcc 
cttatattgt 
ggctcaagga 



ccggctgtgc 
aggtcgtgcg 
taggggaaat 
gttagcggta 
ttgtaggtct 
aagccttatc 
tgctgcggtg 
tgacgacttg 
tgacgttcct 
aagccctctc 
cgttttcaag 
aacgcactcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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caaaccctgg 
gtttccgaaa 
accggatact 
gttgagcagg 
ggtttgtcta 
tacgacgatc 
aagcacgagc 
gatattgacg 
ggagtttcta 
tccatgtctg 
aaagccaaga 
tctcctgctc 
gccgctggta 
gtcactgttg 
tgtgttgaca 
gccaccgcca 
tactttttga 
gctcagctga 
tcggacggaa 
tgggcgtttg 
acaagtgttg 
gtcgccgaag 
ttactgtggt 
cgcttttcct 
tttcaagttg 



ctgctttgtc 
gcattgatct 
tcccggagaa 
actcgagagt 
gaatctctca 
ttgctggaga 
agttcggtgg 
gaaacggtca 
agaactccaa 
acgtgatcaa 
aggacggttt 
ttgacttggc 
acgacaatgc 
gtgcgtccac 
tttttgctcc 
ctctttctgg 
gcttgcaacc 
agaagaacct 
ccccaaatat 
aagacgaggc 
acttagacct 
aggttgctga 
ttcggaaact 
tttttttctg 
cttccaatat 



taaggacatg 
tactgaggtg 
ggtggttgac 
tttcgccgat 
cagagagcct 
tggcgtcacg 
cagagcagag 
cggtactcac 
aattgtcgca 
gggtgttgaa 
caagggatcg 
tgtcaatgcc 
tgacgcatgc 
tttgtctgac 
gggcttgaac 
tacatcgatg 
agaatcgtcg 
gatcaagttt 
tctcatttac 
ctcggccaag 
gcaagatatc 
tttgttcgat 
gggtttagac 
ttgtttggag 
tgtttgtcca 



ccagcagaag 
tttagcatct 
ctcatcagaa 
aagtcgtcta 
ctcagtctcg 
gcttatgtca 
tggggtaaga 
tgcgctggta 
gtgaaggttt 
ttcgctgcaa 
actgccaaca 
gctgtcaaag 
aactattctc 
tctagagctt 
atcctttcca 
gcctcccctc 
tcgttgtttt 
ggtacgaaga 
aacggtgctg 
tccgacttga 
aaggagaagt 
taggtttcta 
agcggtcctg 
tgtttgtttt 
gattagagtc 



aattggccgc 
ccgatttgtt 
gacaccctga 
ctcaaaacgg 
gcaatttcaa 
ttgataccgg 
ccatcccaac 
caattggctc 
tgagatctaa 
atgatcacgt 
tgtctttggg 
ctggtttaca 
ctgctgctgc 
acttttccaa 
cctacatagg 
acgtttgtgg 
cttcggcagc 
acgttttgtc 
gcaagaacat 
agaaggctgt 
tcaaccatat 
acaattcagt 
ttactcatat 
tctggataat 
attgcttgaa 



cgaaggtttc 
cagtggatat 
cgtggcgttc 
tgctccttgg 
cgagtacgtt 
tatcaatgtg 
cggtgatgat 
ggaagattat 
cggttctggt 
tgccaagtct 
aggtggcaag 
ctttgctgtt 
agagaacgca 
ctatggtaaa 
ttctgacact 
tctgttgacc 
tatctcccct 
tgagattcca 
cagtgacttc 
cgatattgcc 
tttggaggag 
gatcttgtct 
tgcgcttgat 
gtggttagtt 
gctt 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2214 



<210> 58 

<211> 539 

<212> PRT 

<213> Ogataea minuta 



<400> 58 

Met Lys Leu Ser Gin Ser Ala Ala Val Ala He Leu Ser Ser Leu Ala 
15 10 15 

Ala Val Glu Ala Leu Val He Pro Leu Phe Asp Asp Leu Pro Ala Glu 
20 25 30 



Phe Ala Leu Val Pro Met Asp Ala 
35 40 

Pro Val Asp Ser Ala He Ser Asp 
50 55 

Ala Pro Ser Pro Leu Val Thr Ser 
65 70 

Pro Asn Ser Tyr He Val Val Phe 
85 

Val Asp Phe His Met Glu Trp Leu 
100 



Lys Ala Glu Val He Ser Asp Val 
45 

Ala Pro He Ala Ala Leu Asn Asp 
60 

Leu He Ala Ser Gin Asn Leu He 

75 80 

Lys Asn Gly Leu Ala Ser Gly Ala 
90 95 

Lys Glu Thr His Ser Gin Thr Leu 
105 110 



Ala Ala Leu Ser Lys Asp Met Pro Ala Glu Glu Leu Ala Ala Glu Gly 
115 120 125 



Phe Val Ser Glu Ser lie Asp Leu Thr Glu Val Phe Ser lie Ser Asp 
130 135 140 

Leu Phe Ser Gly Tyr Thr Gly Tyr Phe Pro Glu Lys Val Val Asp Leu 
145 150 155 160 

lie Arg Arg His Pro Asp Val Ala Phe Val Glu Gin Asp Ser Arg Val 
165 170 175 

Phe Ala Asp Lys Ser Ser Thr Gin Asn Gly Ala Pro Trp Gly Leu Ser 
180 185 190 

Arg lie Ser His Arg Glu Pro Leu Ser Leu Gly Asn Phe Asn Glu Tyr 
195 200 205 

Val Tyr Asp Asp Leu Ala Gly Asp Gly Val Thr Ala Tyr Val He Asp 
210 215 220 

Thr Gly He Asn Val Lys His Glu Gin Phe Gly Gly Arg Ala Glu Trp 
225 230 235 240 

Gly Lys Thr He Pro Thr Gly Asp Asp Asp He Asp Gly Asn Gly His 
245 250 255 

Gly Thr His Cys Ala Gly Thr He Gly Ser Glu Asp Tyr Gly Val Ser 
260 265 270 

Lys Asn Ser Lys He Val Ala Val Lys Val Leu Arg Ser Asn Gly Ser 
275 280 285 

Gly Ser Met Ser Asp Val He Lys Gly Val Glu Phe Ala Ala Asn Asp 
290 295 300 

His Val Ala Lys Ser Lys Ala Lys Lys Asp Gly Phe Lys Gly Ser Thr 
305 310 315 320 

Ala Asn Met Ser Leu Gly Gly Gly Lys Ser Pro Ala Leu Asp Leu Ala 
325 330 335 

Val Asn Ala Ala Val Lys Ala Gly Leu His Phe Ala Val Ala Ala Gly 
340 345 350 

Asn Asp Asn Ala Asp Ala Cys Asn Tyr Ser Pro Ala Ala Ala Glu Asn 
355 360 365 

Ala Val Thr Val Gly Ala Ser Thr Leu Ser Asp Ser Arg Ala Tyr Phe 
370 375 380 

Ser Asn Tyr Gly Lys Cys Val Asp He Phe Ala Pro Gly Leu Asn He 
385 390 395 400 

Leu Ser Thr Tyr He Gly Ser Asp Thr Ala Thr Ala Thr Leu Ser Gly 
405 410 415 

Thr Ser Met Ala Ser Pro His Val Cys Gly Leu Leu Thr Tyr Phe Leu 
420 425 430 
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Ser Leu Gin Pro 
435 

Pro Ala Gin Leu 
450 

Leu Ser Glu lie 
465 

Gly Ala Gly Lys 



Ser Ala Lys Ser 
500 

Asp Leu Asp Leu 
515 

Glu Val Ala Glu 
530 



Glu Ser Ser Ser 
440 

Lys Lys Asn Leu 
455 

Pro Ser Asp Gly 
470 

Asn lie Ser Asp 
485 

Asp Leu Lys Lys 



Gin Asp lie Lys 
520 

Glu Val Ala Asp 
535 



Leu Phe Ser Ser 



lie Lys Phe Gly 
460 

Thr Pro Asn lie 
475 

Phe Trp Ala Phe 
490 

Ala Val Asp lie 
505 

Glu Lys Phe Asn 



Leu Phe Asp 



Ala Ala He Ser 
445 

Thr Lys Asn Val 



Leu He Tyr Asn 
480 

Glu Asp Glu Ala 
495 

Ala Thr Ser Val 
510 

His He Leu Glu 
525 



<210> 59 
<211> 9 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> M0D_RES 

<222> (1) 

<223> His or Asn 



<220> 

<221> MOD_RES 

<222> (5) \ 
<223> Val or Thr 



<400> 59 

Xaa Tyr Asp Trp Xaa Phe Leu Asn Asp 
1 5 



<210> 60 

<211> 12 

<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 60 

Tyr Asn Leu Cys His Phe Trp Ser Asn Phe Glu He 
15 10 



<210> 61 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PKR5 for amplification of 5 ■ -region of 
Ogataea minuta KTR1 gene 

<220> 

<2 21> modif ied__base 

<222> (15) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, g or t 

<400> 61 

maytaygayt ggrynttyyt naayga 26 



<210> 62 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PKR3 for amplification of 3 1 -region of 
Ogataea minuta KTR1 gene 

<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a, c, g or t 

<400> 62 

atytcraart tnswccaraa rtgrcanarr ttrta 35 



<210> 63 
<211> 1930 
<212> DNA 

<213> Ogataea minuta 



<400> 63 

gagctctata 

agtgagttct 

acaatggcta 

ttatgtggat 

tccggctcga 

gagaaaagcg 

aaggtgaagg 

tcgatcagac 

aacgacgcgg 



tttagctttg 
atagaaagac 
gagcgaatgc 
acattttatc 
gttccagtgg 
ttgcaggcgc 
ccacttttgt 
aggtcgaaga 
aattcaacga 



gacattggta 
aggctacaaa 
gaggctgatc 
caagggctcg 
cactgttgct 
agccgagaaa 
ctctttggcc 
ccgtttcaac 
cgagttcaag 



ctagttggac 
gaccaccaag 
cggtttgcaa 
tctacttcgt 
aatactgaga 
agcgttcctg 
agaaaccagg 
aacaagtatc 
aaggtgacct 



tgttgatcgg 
gctggcaaat 
tctttgctac 
atacgatttc 
aatctgccct 
cagctgacgt 
atctgtggga 
attacgattg 
ctcaggtctg 



ttgacttgac 60 
ttgcgagatt 120 
cgtgttggtt 180 
gacgccagag 240 
cgcagtgggt 3 00 
cccagatgga 3 60 
gctggtgaac 420 
ggtgttcttg 480 
ttcgggtaag 540 
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accaagtatg 
aaggctgctg 
tcgtacagac 
gagtacgagt 
gacatcttca 
gagtacaagg 
ccacagtacg 
tacaatggat 
gaagcctaca 
tggggagatg 
catttcttcg 
gctgtcagag 
tactcctgta 
cagtacatcc 
tacaagtctt 

ggggtttctg 

tgtctcccgg 
acgcccgcag 
gatgagcttc 
caaggacttc 
cactgagtcc 
ccaattggtt 
tgaatcttcg 
gtttgagctc 



gtgtcattcc 
ccaccagaga 
acatgtgcag 
actactggag 
agttcatgaa 
agaccatccc 
ttgcccagga 
gtcacttctg 
ccaagtactt 
cccctatcca 
acgacgtcgg 
aggccaagaa 
cccctaagtt 
attagttgac 
tcaatactaa 
taatttttgt 
gggcaaacaa 
gtaaacgcaa 
gagaacgttc 
ctctccttct 
gagcgacttg 
tacgaggtag 
cccagtgagg 



aaaggaacag 
gcaaatgaga 
atacgagtcg 
agtggagcca 
ggacaacaag 
aactctgtgg 
caacctgatc 
gtctaact tc 
tgaggcgttg 
ctcgattgcc 
atacttccac 
ctgtgtctgc 
ttaccaggag 
ccaggccacg 
attagctgga 
ttcgagaaaa 
cgtgctctct 
tgtccgatac 
tcgaggttct 
cgacgatcat 
agctgttgtc 
gatgggactt 
ggctgatgaa 



tggagcttcc 
aaggacaaga 
ggattcttct 
agcatcaaga 
aagtcgtacg 
aagaccacta 
aactttattt 
gaggtcggct 
gaccaggctg 
gttgctctgt 
aatccgttcc 
aaccaagccg 
atgggtttga 
ggttgatttc 
ttcaagtgat 
ataaatctac 
ctgctactaa 
agattctgag 
agaagactct 
cgacgtccat 
gaaactgaca 
accaccgatc 
cagcaaggtc 



cttcgtggat 
tcatctacgg 
tcaaacaccc 
tctactgtga 
gatggaccat 
gagacttcat 
cggacgacgg 
cgctcgagtt 
gtgggttctt 
tcatgcctaa 
acaactgccc 
acgacttcac 
aaaagcctgc 
gcctggttgt 
acgagatgat 
aaaaaaacgt 
gtgttttgtt 
tcagtctcga 
gcgtctgagt 
ctgggtgatc 
tctattctga 
atattcagct 
acggttcttt 



cgacactgat 
agactccatc 
agaactcgca 
cattgactac 
ttctcttcct 
gaaggaaaac 
aggaagcagc 
ctggagaggc 
ctacgaaaga 
ggacgaggtt 
gatcgacaac 
cttccagcac 
taactgggag 
tttttgtttt 
tttcatctcc 
gccagatact 
tctgtccaca 
cgatcacaca 
gctccaagaa 
tttccattta 
gcaatgacca 
tcctggactc 
tcttgaagct 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1930 



<210> 64 
<211> 403 
<212> PRT 

<213> Ogataea minuta 



<400> 64 
Met Ala Arg Ala 
1 

Val Leu Val Leu 
20 

Tyr Thr lie Ser 
35 

Ala Asn Thr Glu 
50 

Gly Ala Ala Glu 
65 

Val Lys Ala Thr 



Leu Val Asn Ser 
100 

His Tyr Asp Trp 
115 

Lys Lys Val Thr 
130 



Asn Ala Arg Leu 
5 

Cys Gly Tyr lie 



Thr Pro Glu Ser 
40 

Lys Ser Ala Leu 
55 

Lys Ser Val Pro 
70 

Phe Val Ser Leu 
85 

He Arg Gin Val 



Val Phe Leu Asn 
120 

Ser Gin Val Cys 
135 



He Arg Phe Ala 
10 

Leu Ser Lys Gly 
25 

Gly Ser Ser Ser 



Ala Val Gly Glu 
60 

Ala Ala Asp Val 
75 

Ala Arg Asn Gin 
90 

Glu Asp Arg Phe 
105 

Asp Ala Glu Phe 



Ser Gly Lys Thr 
140 



He Phe Ala Thr 
15 

Ser Ser Thr Ser 
30 

Ser Gly Thr Val 
45 

Lys Ser Val Ala 



Pro Asp Gly Lys 
80 

Asp Leu Trp Glu 
95 

Asn Asn Lys Tyr 
110 

Asn Asp Glu Phe 
125 

Lys Tyr Gly Val 
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lie Pro Lys Glu Gin Trp Ser Phe Pro Ser Trp lie Asp Thr Asp Lys 
145 150 155 160 

Ala Ala Ala Thr Arg Glu Gin Met Arg Lys Asp Lys lie lie Tyr Gly 
165 170 175 

Asp Ser lie Ser Tyr Arg His Met Cys Arg Tyr Glu Ser Gly Phe Phe 
180 185 190 

Phe Lys His Pro Glu Leu Ala Glu Tyr Glu Tyr Tyr Trp Arg Val Glu 
195 200 205 

Pro Ser lie Lys lie Tyr Cys Asp lie Asp Tyr Asp lie Phe Lys Phe 
210 215 220 

Met Lys Asp Asn Lys Lys Ser Tyr Gly Trp Thr lie Ser Leu Pro Glu 
225 230 235 240 

Tyr Lys Glu Thr lie Pro Thr Leu Trp Lys Thr Thr Arg Asp Phe Met 
245 250 255 

Lys Glu Asn Pro Gin Tyr Val Ala Gin Asp Asn Leu lie Asn Phe lie 
260 265 270 

Ser Asp Asp Gly Gly Ser Ser Tyr Asn Gly Cys His Phe Trp Ser Asn 
275 280 285 

Phe Glu Val Gly Ser Leu Glu Phe Trp Arg Gly Glu Ala Tyr Thr Lys 
290 295 300 

Tyr Phe Glu Ala Leu Asp Gin Ala Gly Gly Phe Phe Tyr Glu Arg Trp 
305 310 315 320 

Gly Asp Ala Pro lie His Ser lie Ala Val Ala Leu Phe Met Pro Lys 
325 330 335 

Asp Glu Val His Phe Phe Asp Asp Val Gly Tyr Phe His Asn Pro Phe 
340 345 350 

His Asn Cys Pro lie Asp Asn Ala Val Arg Glu Ala Lys Asn Cys Val 
355 360 365 

Cys Asn Gin Ala Asp Asp Phe Thr Phe Gin His Tyr Ser Cys Thr Pro 
370 375 380 

Lys Phe Tyr Gin Glu Met Gly Leu Lys Lys Pro Ala Asn Trp Glu Gin 
385 390 395 400 

Tyr lie His 



<210> 65 
<211> 10 
<212> PRT 

<213> Saccharomyces cerevisiae 
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<400> 65 

Thr Ser Trp Val Leu Trp Leu Asp Ala Asp 
15 10 



<210> 66 
<211> 10 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 66 

Glu Thr Glu Gly Phe Ala Lys Met Ala Lys 
15 10 



<210> 67 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PMN5 for amplification of 5 * -region of 
Ogataea minuta MNN9 gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (27) 

<223> a, c, g or t 



<400> 67 

acnwsntggg tnytntggyt ngaygcnga 



29 
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<210> 68 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PMN3 for amplification of 3 1 -region of 
Ogataea minuta MNN9 gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (24) 

<223> a, c, g or t 

<400> 68 

ttngccatyt tngcraancc ytcngtytc 2 9 



<210> 69 

<211> 2221 

<212> DNA 

<213> Ogataea minuta 



<400> 69 

gggcccagag 

ctgccccgca 

ccggcctttc 

agcccctgtt 

atctttcaga 

ggctagtcag 

gcatcgcagg 

taaccgtagt 

ttacccaaaa 

ggcacactgg 

ggtccatccc 

cgcctcgaaa 

aagacgcaaa 

agaaacacaa 

catctctttc 

attggctaaa 

gtacaccaga 



gaggaggtag 
cttcggccgc 
cattgaagcg 
tgccccgaaa 
tggccgcagg 
tagggcaccc 
agtgttaagc 
cggcgcgcga 
gaagcaggac 
acggtcatcc 
ctagcaaact 
tccccttgaa 
acgccttact 
gagctttacc 
agcaccatta 
taatcagtgc 
tacgaaggaa 



gagtgggggg 
accgccaacc 
agctaagcag 
ccgggttaga 
ttgctggcag 
tgcgggttag 
ggcagaaaag 
gtccgagaag 
tggaaggaaa 
tagtgtggtt 
cgagccggat 
gtacacagcc 
t tgaacaaca 
aacctctaaa 
tagaactccg 
tgatttggac 
acccgtgaag 



atcgccaatt 
ttttctttgc 
tcagagagca 
caaaacgcgg 
cttcgggctg 
taagtctccc 
aggaggtgga 
ttaatcgacg 
cagaccggga 
ccgctcaccg 
caccctattc 
tgaaatttgg 
tcaacatcta 
aaataaccta 
gaaagcatat 
atgttgaaag 
gtgttggttc 



ttctggcgcg 
gcaccccctc 
ccaccgggag 
ttcggcttca 
acaacatcat 
tgcaggtcac 
gtggggacga 
cgtcgaaact 
ttggtgtgta 
cttacctggc 
tggccggttt 
ctttttcttc 
gcaaatgctg 
ggctcccgtt 
tcacagcacg 
gcgttttgaa 
ccgtcttcgg 



ctactccgcg 
tccaacgttt 
acgtgatcgc 
acggactctc 
ggctgacgta 
cgttgcttga 
gagatccggg 
gggtcttttg 
tttctgtcag 
tggtgttcct 
tgctatttcc 
actgtcgtgc 
acgaaatttg 
tgcagctccg 
tgagacgcgg 
acaccctctg 
attggctgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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ttgttgtttc 
tacaagcgca 
cactatgacc 
ttgattttga 
acatatccac 
caagcattga 
gaacctaagt 
tcggaaaagg 
gccagaaact 
gactcagata 
ccagtcattg 
atccgtcctt 
acgatgtcgg 
ctgatgggcc 
gtcggaggaa 
aacttcccct 
ggctaccagg 
gtcttttata 
atgtgtcgcg 
caactagaat 
t 



tggtgtttgg 
acaacagaga 
tgaacaacct 
cgccaatggc 
gtgacctgat 
agaagctgga 
tcgccaaagt 
acagacacgc 
cgctgctgtt 
tcgtggagtc 
ctgccaattg 
acgacttcaa 
acgacgagat 
atttctatga 
cctgtctgat 
tctaccatct 
tgtttggtct 
tagttgagca 
acgcgacgcc 
aggttcgcac 



aggctcgtct 
tgaggtgatt 
tgcgtcgacg 
gaagtttctg 

c 9 a 9 ctc 99 a 
gcacgcggtg 
cacgatcctc 
gttcaaggtg 
caccaccatt 
gcctcacacg 
ctaccagaga 
caactggatc 
catcgtggaa 
tcctaacggc 
ggtgaaggcc 
catcgaaacc 
tccaaactat 
aaaatgaaaa 
gtagttgttg 
acgaccccac 



tccaacagaa 
ccacgtaatt 
ccgatggctg 
gacggatact 
ttcattgtgc 
aagattatcc 
agacaggaca 
cagaaagaac 
ggcccgtaca 
ttgatccagg 
tactacgacg 
gagtctgaag 
gcgtacgcag 
gacctgggaa 
gacgtccatc 
gaagggttcg 
cttgttttcc 
aacatgtcaa 
caccgcaacg 
cgttccgatt 



agaccaacag 
tgccagcgga 
cttacaacaa 
gggacaactt 
cgcgcacagc 
agaacccaaa 
acgagtccct 
ggcgcgcaca 
cctcatgggt 
atcttgtttc 
aggacaagaa 
agggactacg 
aaattgccac 
ccgagatgca 
gcgacggggc 
ccaaaatggc 
actacaacga 
aaatagcaag 
cgaacttctg 
tccttatcag 



tccctactcg 
tcacatctcc 
ggagagagtg 
gctgaaattg 
agagggagac 
gaacaccaag 
ttcgtcacag 
aatggccaca 
tctgtggctt 
gcacgacaag 
ggaggactcc 
gatcgcatcc 
ctatcgtcca 
a ctggatggt 
catgttcccg 
caaacggctt 
gtgactcttg 
acaacgtgaa 
tcgcgcctgt 
caaagagatc 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2221 



<210> 70 
<211> 367 
<212> PRT 

<213> Ogataea minuta 
<400> 70 

Met Leu Lys Gly Val Leu Lys His Pro Leu Val His Gin lie Arg Arg 
15 10 15 

Lys Pro Val Lys Val Leu Val Pro Val Phe Gly Leu Ala Val Leu Leu 
20 25 30 

Phe Leu Val Phe Gly Gly Ser Ser Ser Asn Arg Lys Thr Asn Ser Pro 
35 40 45 

Tyr Ser Tyr Lys Arg Asn Asn Arg Asp Glu Val lie Pro Arg Asn Leu 
50 55 60 

Pro Ala Asp His lie Ser His Tyr Asp Leu Asn Asn Leu Ala Ser Thr 
65 70 75 80 

Pro Met Ala Ala Tyr Asn Lys Glu Arg Val Leu lie Leu Thr Pro Met 
85 90 95 

Ala Lys Phe Leu Asp Gly Tyr Trp Asp Asn Leu Leu Lys Leu Thr Tyr 
100 105 110 

Pro Arg Asp Leu lie Glu Leu Gly Phe lie Val Pro Arg Thr Ala Glu 
115 120 125 

Gly Asp Gin Ala Leu Lys Lys Leu Glu His Ala Val Lys lie lie Gin 
130 135 140 



Asn Pro Lys Asn Thr Lys Glu Pro Lys Phe Ala Lys Val Thr lie Leu 
145 150 155 160 
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Arg Gin Asp Asn Glu Ser Leu Ser Ser Gin Ser Glu Lys Asp Arg His 
165 170 175 

Ala Phe Lys Val Gin Lys Glu Arg Arg Ala Gin Met Ala Thr Ala Arg 
180 185 190 

Asn Ser Leu Leu Phe Thr Thr lie Gly Pro Tyr Thr Ser Trp Val Leu 
195 200 205 

Trp Leu Asp Ser Asp lie Val Glu Ser Pro His Thr Leu lie Gin Asp 
210 215 220 

Leu Val Ser His Asp Lys Pro Val lie Ala Ala Asn Cys Tyr Gin Arg 
225 230 235 240 

Tyr Tyr Asp Glu Asp Lys Lys Glu Asp Ser lie Arg Pro Tyr Asp Phe 
245 250 255 

Asn Asn Trp lie Glu Ser Glu Glu Gly Leu Arg lie Ala Ser Thr Met 
260 265 270 

Ser Asp Asp Glu lie lie Val Glu Ala Tyr Ala Glu lie Ala Thr Tyr 
275 280 285 

Arg Pro Leu Met Gly His Phe Tyr Asp Pro Asn Gly Asp Leu Gly Thr 
290 295 300 

Glu Met Gin Leu Asp Gly Val Gly Gly Thr Cys Leu Met Val Lys Ala 
305 310 315 320 

Asp Val His Arg Asp Gly Ala Met Phe Pro Asn Phe Pro Phe Tyr His 
325 330 335 

Leu lie Glu Thr Glu Gly Phe Ala Lys Met Ala Lys Arg Leu Gly Tyr 
340 345 350 

Gin Val Phe Gly Leu Pro Asn Tyr Leu Val Phe His Tyr Asn Glu 
355 360 365 



<210> 71 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DMN5 

<400> 71 

agatgaggtg attccacgta atttgccagc 



<210> 72 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DMN3 

<400> 72 

ttttgattgt catctatttc gcacaccctg 30 



<210> 73 
<211> 12 
<212> PRT 

<213> Pichia pastoris 
<400> 73 

Gly Gly Gly Ser Ser lie Asn Phe Met Met Tyr Thr 
1 5 10 



<210> 74 
<211> 10 
<212> PRT 

<213> Pichia pastoris 
<400> 74 

Asp Met Trp Pro Met Val Trp Ala Tyr Lys 
15 10 



<210> 75 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PAX 5 for amplification of 5 ' -region of 
Ogataea minuta A0X1 gene 

<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (9) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 
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<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 

<400> 75 

ggnggnggnw snwsnathaa yttyatgatg tayac 3 5 



<210> 76 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PAX3 for amplification of 3 1 -region of 
Ogataea minuta AOX1 gene 

<220> 

<221> modif ied_base 

<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, g or t 

<400> 76 

ttrtangccc anaccatngg ccacatrtc 29 



<210> 77 

<211> 5817 

<212> DNA 

<213> Ogataea minuta 



<400> 77 

aagctttctt 

aggcgcactt 

tttgcgatta 

ggctagcgat 

aagatccgaa 

tcaatgtagc 

caagtcgttg 

ccgacgtgca 

tggaggattt 

ttgaaagcgc 

cgcagaacca 

ccaacggcac 

gagaagatct 



tcgcaaacag 
taaaagataa 
cgtaagccac 
agaatgtatc 
gaaagggagc 
tgttcaggtc 
ctgcggttcg 
gggtgttttc 
tcaccccgta 
accatctgat 
cctcttccag 
gctcctttct 
cctgcgcaaa 



ctctttggta 
tctacatcca 
aaaatttcag 
agtgttatta 
aagaaggcca 
tttcaacagg 
actggcatag 
gggcttgatg 
ggacttgtcg 
tttcttagag 
ctgctcgcga 
gatgatgatg 
gtccagggtc 



gaggagaata 
gaaaaataaa 
gaaactcgta 
gtggctctag 
cgccagacgt 
ttcttggtct 
tcgttggcgc 
gttcggttgt 
ccaatatcga 
gaagggaagt 
acgagtttgt 
gcatgcaatg 
cggatgatgt 



gagtgcccag 
aaaataaaac 
caagatcagg 
gagtagaaaa 
tctagtaggt 
cgtctggact 
cgagggagct 
cgttgcagct 
cccagatgtc 
ggcggagacg 
agccttcttt 
aaagtttctt 
cttcctcgga 



ctgataaaga 60 
tgaaccggca 120 
ttggcgaggg 180 
caatagaata 240 
agcccaatcg 300 
ggagatccaa 360 
gaactggtcg 420 
gaggaactct 480 
cgagcctccg 54 0 
tttgtctacg 600 
ggggaccagt 66 0 
gatcaggtcc 720 
ccagtcgagc 780 
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atgttttcaa 
tagcggaata 
gtgagctttg 
gagccgtcgg 
ttgatgagat 
gtgatgtctc 
aaagtgttgg 
gagggcgcta 
ttctccaggg 
gacatttcag 
tcatggaaga 
gcggagatcc 
atccactcaa 
gttgtcattt 
ttcctacttg 
gactgcaaat 
agagttcgag 
ggaaatagcc 
cccgcgctgg 
ttcgcgtagc 
tgacagtttg 
ttagagacct 
ccgtgtgacg 
cactctgggc 
aaactgcgta 
aagaagcaga 
cgaaaaaaat 
gctgcgccat 
agggtggtga 
acatgagact 
acggcagaag 
tcctcatgta 
ccaccgacga 
acagggaggt 
caactgccca 
ttgaagacct 
tcggtagaag 
atctgttcct 
tcggtgtcag 
tcaaggccag 
aaagatctgg 
acttgcctgg 
ccaagccaga 
ccgcctttga 
ctggtgtcaa 
aagggtacca 
tttccggttt 
actttttgga 
aagctccaga 
ggtcttacaa 
cctcgcacca 
agacatgcaa 
ggactgttcc 
tcgagatctt 
tcaaggagca 
aaggtagcaa 
tgaagaacct 
actccactgc 



gcagccattt 
ggtagtttgc 
gggtggtctt 
agtagcccgc 
ggggcaagat 
cgtcaaaagt 
agagctcgag 
caagtctcct 
gcaatcgggt 
gcattcccgc 
tttccgcgta 
cattggactt 
tcagtccgtc 
gcgggagggg 
aatgctaggc 
gacgacggta 
ccggggaggg 
gtcaaaagaa 
ctttccccgg 
ccctcggttg 
tggggtaaca 
actacggagt 
acacccccgc 
atgagcagtg 
taaatagcta 
tcactcaact 
ggctattcct 
cgccggtaga 
gaacaacatc 
cgactccaag 
ggccattgtc 
caccagagcc 
gctgcttccg 
gcacggtttc 
agacttcctg 
caaggcctcg 
atcggactcg 
cattacgtcc 
gaccgttcca 
aaagcaaatt 
tatcggtgcg 
tgtcggtgag 
ggttccaacc 
ccagtggtac 
gatcagacca 
cgagtacttt 
cttcggtgac 
gtacccattt 
ctttgatcca 
gaagtcgaga 
cccactctac 
ggcatttgct 
aattgagaag 
ctccgacatt 
cactgagacc 
gatcgctcca 
caaggttgcc 
tcttctgatt 



gtctttggag 
ttcgcctccc 
cacctcgtct 
ggcagtgacg 

c 99 c 9 ac 9 a 9 
cacgagctgg 
aagttgcgcc 
tttgctgatt 
gaagaaacag 

ct 999 tcatc 
gcgtcgcttc 
gactgcatag 
tcggcggtgc 
tgtaaagcag 
tggctaattg 
aacagaatcc 
agtgacgatg 
aaacagaagg 
ccggggaggc 
cctattgagg 
a cggtgtccg 
ttcagtgtct 
atggcttttg 
gtgtgcctgg 
cttccatctc 
tgttcaaaga 
gacgaattcg 
ctcggtaacc 
aataacccat 
acggctacct 
ccctgcgcta 
tcggcctccg 
ctcatgaaga 
gacggtccga 
agagcctgcg 
cacggagctg 
gcacacgcct 
accaaggctg 
atgaagccgg 
gtggtttcgt 
gcccacaagc 
aacttccagg 
tttgacgact 
tccaacaagg 
accgacgagg 
gacaacaagc 
cacaccaaga 
tcgagaggtt 
ggtttcctga 
cagacagcca 
ccggttgact 
ggaccaaacc 
ccaacgccaa 
gactactctg 
acctggcact 
aagggtggtg 
gacctgtcga 
ggtgagaagg 



aagaactcga 
atcaccagaa 
atgagcccct 
atgccgacat 
gcgtcgaagt 
agtctgcggt 
gtgttcagaa 
gtgcggcgtc 
ccaacggaag 
tcgatgttgt 
gcttccgaat 
agaacaaacg 
tccttgagcg 
ctcagccggt 
ggtatggcac 
actcagctgg 
cagccagaaa 
ggctgcagtt 
agccggctaa 
gttacttttc 
aggccagcta 
tcggaagctg 
gcaatctcac 
tgtatctctg 
ctacttgttg 
ctcttgtgtt 
atatcatcgt 
tcgacccgga 
gggtctacct 
tctacaactc 
acattcttgg 
actacgacga 
agctcgagac 
tcaaggtctc 
agtcgcaggg 
agtactggct 
acatccaccc 
acaaggtgat 
tcgagaccaa 
gcggtacgat 
tgagacaagc 
accactactg 
ttgtcagagg 
acggtcctct 
agttggccac 
cagacaagcc 
ttccaaacgg 
tcgtttatgc 
acgattccag 
gaagaatgga 
ctccagcccg 
acttcaccgc 
agaacgactc 
ccgaggacga 
gtttgggaac 
ttgtcgatgc 
tctgcccaga 
ctgccacttt 



gtccgccaag 
cgttctggcg 
tgagtcgggc 
agaggtcttt 
tggagccgtc 
gtacggatgt 
tgagccgaat 
cgtcctcgat 
gcaccaattg 
cgttgatcag 
tcaccatgag 
gggtggccag 
cgtactcgac 
gactgtgcaa 
aaacggcaca 
cactaactgg 
aagagccggt 
ttgctgccgc 
agaaaatagc 
gctccctctt 
ttcggcaaac 
cacagcccga 
atcgcccctc 
gcccccgcgg 
caccattgcc 
ctgttacgac 
tgtgggtgga 
cgttactgtg 
tcctggtgtc 
gagaccatcc 
tggaggttcc 
ctgggagcaa 
gtatcaacgt 
cttcggtaac 
tattcctttc 
caagtggatc 
taccatgaga 
cattgagaac 
aaaccctcca 
ctcctctcca 
gggcatcaag 
cttcttcacc 
tgacccagtc 
taccaccaac 
ggctgacgat 
actgatgcat 
caagttcttc 
tgtttcccca 
agacatgtgg 
gtcgtttgct 
tgccaaggac 
caacttgtac 
gcacgtgacc 
tgaggctatt 
ctgttcgatg 
cagattgaac 
taacgttgga 
ggtcgccgag 



ctcgttggag 
ctgtctgtcg 
gtagtacttg 
ggcgagcagc 
atcgtagaga 
tttgttgtgg 
gtcgttgaac 
gtagaacgcc 
gaccaatctg 
cagctcgagg 
gtcgtccact 
caagcccttg 
tctgtatctg 
ggacgaacgg 
aacggcagat 
gtgtagacta 
acgcaatcag 
ccgccgcgcg 
ctatttcgat 
ttgggccaac 
aatagacaga 
atgtcggagc 
cctgcgtctc 
ggcagacagc 
atagtaagaa 
ttacgactta 
ggctcatgcg 
gctctcatcg 
tatccaagaa 
aagcacctga 
tccatcaact 
gagggatgga 
ccttgcaaca 
tacacctacc 
aacgacgatc 
aacagggatc 
aacaagagca 
ggcgttgctg 
agcaggatct 
ttggtgctgc 
ccgatcgtcg 
ccatactatt 
gctcaaaagt 
ggtatcgagg 
gacttcatcc 
tactctgtca 
accatgttcc 
gacccatacg 
cctatggttt 
ggtgaagtca 
ttggatctcg 
cacggttcct 
tgcaaccagg 
gtcaagtaca 
gctccacaag 
gtgtacgaag 
tgtaatactt 
gacctgggat 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3 78 0 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 
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actcaggatc 

agggtctggc 

ttctgttttt 
ctctctatag 
atacggggcg 
tttataacga 
taggctaact 
tgcgcaaaaa 
atctttgtca 
aaatcgagga 
cggactctgg 
gagtgtgcat 
gtgaaattgt 
acgttccgca 
tgcccagcga 
ataaaagata 
ggcacttaat 
aggaggtaaa 
gaaaaaaggc 
cctatttgga 
acgtcaagca 
gggctattca 
tttcggctat 
gagttgcgcg 
ttgtcttttg 
tgattgtcaa 



tgatctcgcc 
tagattctaa 
ctctacgatt 
ttttgactgt 
tctggtcacg 
aaaaaagagt 
caggaagcat 
ggtgctgatg 
agagggagcc 
aatgaagcct 
tatctctcga 
gggccaacag 
tcacggaaaa 
gggagttgct 
gttggaggtg 
cactattgaa 
gatcaagaac 
acaaggcggt 
gtacgaagag 
gctatgtggt 
aggaaagcct 
gatgggtgcc 
atccgttttg 
tcagacggtt 
caagtaccag 
gatgctttca 



atgaccattc 
gagacggtgc 
ctacttgatg 
aatgaaaata 
tgatgtatca 
ctaaatgaaa 
tccgagcaac 
atcgacaatt 
gatgtcgagg 
gacattatag 
aagactattg 
tgcatttacg 
acctcttctg 
gtgacgagat 
actgcccgta 
ggcgttcagt 
attttgaaga 
gccaagaagg 
attgaaaaac 
gccccagacg 
gccattttga 
aatgctgcaa 
acagagccaa 
ggaaaactcc 
attctggagg 
ggagctgagt 



caaacttcaa 
ccgactcttg 
atgtatgacg 
gtctacatga 
cgtgatcttt 
aaaaatcgat 
gcataatgcc 
acgactcgtt 
tctatcgtaa 
tgatttcgcc 
agattttcaa 
aggttttcgg 
tgacccacga 
accattcgtt 
ccactaacgg 
ttcacccgga 
gtagcggtgg 
agtcgatttt 
agccaggtcg 
ttttgaactt 
gtgaaatcaa 
aacaggcgta 
actggttcaa 
aacaccgtcc 
ccagactggc 
tgcgcgaact 



gctaggtact 
ttcaattctt 
agtgaagatt 
atgaaagaga 
aagttttcga 
ctctgccaaa 
ctcaaccaca 
cacatggaac 
cgacaagatc 
aggccccgga 
gggccggatt 
gggagacgtt 
caatcgtgga 
ggctggaacg 
tatcattatg 
gtccattttg 
ttactggaac 
agacaagatt 
ctcgtttgcc 
ctacgaccgg 
gagagcctcg 
cacctatgcc 
aggaacgatt 
gtgcattttg 
gggcgcagac 
ggttggctat 



tacgaggaga 
ttggttcttt 
gtgttttttt 
tagctgacca 
aatgactaaa 
gactcatcga 
gtctcagaga 
ttgtacgagt 
acaattgaag 
catccgagat 
cctgtttttg 
gagtacgctg 
gtcttcaaga 
ctaaaaactt 
ggtgtcaggc 
acagaagagg 
gaggaggagg 
tacggcgaga 
gatttggagg 
ttaaatgaga 
ccttcgaaag 
acggcggggg 
gaggatttac 
cgcaaggagt 
actgttttgc 
tcccggg 



4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5817 



<210> 78 
<211> 663 
<212> PRT 

<213> Ogataea minuta 
<400> 78 

Met Ala lie Pro Asp Glu Phe Asp lie lie Val Val Gly Gly Gly Ser 
15 10 15 

Cys Gly Cys Ala lie Ala Gly Arg Leu Gly Asn Leu Asp Pro Asp Val 
20 25 30 

Thr Val Ala Leu lie Glu Gly Gly Glu Asn Asn lie Asn Asn Pro Trp 
35 40 45 

Val Tyr Leu Pro Gly Val Tyr Pro Arg Asn Met Arg Leu Asp Ser Lys 
50 55 60 

Thr Ala Thr Phe Tyr Asn Ser Arg Pro Ser Lys His Leu Asn Gly Arg 
65 70 75 80 

Arg Ala lie Val Pro Cys Ala Asn lie Leu Gly Gly Gly Ser Ser lie 
85 90 95 

Asn Phe Leu Met Tyr Thr Arg Ala Ser Ala Ser Asp Tyr Asp Asp Trp 
100 105 110 



Glu Gin Glu Gly Trp Thr Thr Asp Glu Leu Leu Pro Leu Met Lys Lys 
115 120 125 
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Leu Glu Thr Tyr Gin Arg Pro Cys Asn Asn Arg Glu Val His Gly Phe 
130 135 140 

Asp Gly Pro lie Lys Val Ser Phe Gly Asn Tyr Thr Tyr Pro Thr Ala 
145 150 155 160 

Gin Asp Phe Leu Arg Ala Cys Glu Ser Gin Gly lie Pro Phe Asn Asp 
165 170 175 

Asp Leu Glu Asp Leu Lys Ala Ser His Gly Ala Glu Tyr Trp Leu Lys 
180 185 190 

Trp lie Asn Arg Asp Leu Gly Arg Arg Ser Asp Ser Ala His Ala Tyr 
195 200 205 

lie His Pro Thr Met Arg Asn Lys Ser Asn Leu Phe Leu lie Thr Ser 
210 215 220 

Thr Lys Ala Asp Lys Val lie lie Glu Asn Gly Val Ala Val Gly Val 
225 230 235 240 

Arg Thr Val Pro Met Lys Pro Val Glu Thr Lys Asn Pro Pro Ser Arg 
245 250 255 

lie Phe Lys Ala Arg Lys Gin lie Val Val Ser Cys Gly Thr lie Ser 
260 265 270 

Ser Pro Leu Val Leu Gin Arg Ser Gly lie Gly Ala Ala His Lys Leu 
275 280 285 

Arg Gin Ala Gly lie Lys Pro lie Val Asp Leu Pro Gly Val Gly Glu 
290 295 300 

Asn Phe Gin Asp His Tyr Cys Phe Phe Thr Pro Tyr Tyr Ser Lys Pro 
305 310 315 320 

Glu Val Pro Thr Phe Asp Asp Phe Val Arg Gly Asp Pro Val Ala Gin 
325 330 335 

Lys Ser Ala Phe Asp Gin Trp Tyr Ser Asn Lys Asp Gly Pro Leu Thr 
340 345 350 

Thr Asn Gly lie Glu Ala Gly Val Lys lie Arg Pro Thr Asp Gl-u Glu 
355 360 365 

Leu Ala Thr Ala Asp Asp Asp Phe lie Gin Gly Tyr His Glu Tyr Phe 
370 375 380 

Asp Asn Lys Pro Asp Lys Pro Leu Met His Tyr Ser Val lie Ser Gly 
385 390 395 400 

Phe Phe Gly Asp His Thr Lys lie Pro Asn Gly Lys Phe Phe Thr Met 
405 410 415 



Phe His Phe Leu Glu Tyr Pro Phe Ser Arg Gly Phe Val Tyr Ala Val 
420 425 430 
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Ser Pro Asp Pro Tyr Glu Ala Pro Asp Phe Asp Pro Gly Phe Leu Asn 
435 440 445 

Asp Ser Arg Asp Met Trp Pro Met Val Trp Ser Tyr Lys Lys Ser Arg 
450 455 460 

Gin Thr Ala Arg Arg Met Glu Ser Phe Ala Gly Glu Val Thr Ser His 
465 470 475 480 

His Pro Leu Tyr Pro Val Asp Ser Pro Ala Arg Ala Lys Asp Leu Asp 
485 490 495 

Leu Glu Thr Cys Lys Ala Phe Ala Gly Pro Asn His Phe Thr Ala Asn 
500 505 510 

Leu Tyr His Gly Ser Trp Thr Val Pro lie Glu Lys Pro Thr Pro Lys 
515 520 525 

Asn Asp Ser His Val Thr Cys Asn Gin Val Glu lie Phe Ser Asp lie 
530 535 540 

Asp Tyr Ser Ala Glu Asp Asp Glu Ala lie Val Lys Tyr lie Lys Glu 
545 550 555 560 

His Thr Glu Thr Thr Trp His Cys Leu Gly Thr Cys Ser Met Ala Pro 
565 570 575 

Gin Glu Gly Ser Lys lie Ala Pro Lys Gly Gly Val Val Asp Ala Arg 
580 585 590 

Leu Asn Val Tyr Glu Val Lys Asn Leu Lys Val Ala Asp Leu Ser lie 
595 600 605 

Cys Pro Asp Asn Val Gly Cys Asn Thr Tyr Ser Thr Ala Leu Leu lie 
610 615 620 

Gly Glu Lys Ala Ala Thr Leu Val Ala Glu Asp Leu Gly Tyr Ser Gly 
625 630 635 640 

Ser Asp Leu Ala Met Thr lie Pro Asn Phe Lys Leu Gly Thr Tyr Glu 
645 650 655 

Glu Lys Gly Leu Ala Arg Phe 
660 



<210> 79 
<211> 2348 
<212> DNA 

<213> Ogataea minuta 



<400> 79 

aagctttctt 

aggcgcactt 

tttgcgatta 

ggctagcgat 

aagatccgaa 

tcaatgtagc 



tcgcaaacag 
taaaagataa 
cgtaagccac 
agaatgtatc 
gaaagggagc 
tgttcaggtc 



ctctttggta 
tctacatcca 
aaaatttcag 
agtgttatta 
aagaaggcca 
tttcaacagg 



gaggagaata 
gaaaaataaa 
gaaactcgta 
gtggctctag 
cgccagacgt 
ttcttggtct 



gagtgcccag 
aaaataaaac 
caagatcagg 
gagtagaaaa 
tctagtaggt 
cgtctggact 



ctgataaaga 6 0 
tgaaccggca 12 0 
ttggcgaggg 180 
caatagaata 24 0 
agcccaatcg 300 
ggagatccaa 3 60 
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caagtcgttg 
ccgacgtgca 
tggaggattt 
ttgaaagcgc 
cgcagaacca 
ccaacggcac 
gagaagatct 
atgttttcaa 
tagcggaata 
gtgagctttg 
gagccgtcgg 
ttgatgagat 
gtgatgtctc 
aaagtgttgg 
gagggcgcta 
ttctccaggg 
gacatttcag 
tcatggaaga 
gcggagatcc 
atccactcaa 
gttgtcattt 
ttcctacttg 
gactgcaaat 
agagttcgag 
ggaaatagcc 
cccgcgctgg 
ttcgcgtagc 
tgacagtttg 
ttagagacct 
ccgtgtgacg 
cactctgggc 
aaactgcgta 
aagaagcaga 
cgaaaaaa 



ctgcggttcg 
gggtgttttc 
tcaccccgta 
accatctgat 
cctcttccag 
gctcctttct 
cctgcgcaaa 
gcagccattt 
ggtagtttgc 
gggtggtctt 
agtagcccgc 
ggggcaagat 
cgtcaaaagt 
agagctcgag 
caagtctcct 
gcaatcgggt 
gcattcccgc 
tttccgcgta 
cattggactt 
tcagtccgtc 
gcgggagggg 
aatgctaggc 
gacgacggta 
ccggggaggg 
gtcaaaagaa 
ctttccccgg 
ccctcggttg 

tggggtaaca 

actacggagt 
acacccccgc 
atgagcagtg 
taaatagcta 
tcactcaact 



actggcatag 
gggcttgatg 
ggacttgtcg 
tttcttagag 
ctgctcgcga 
gatgatgatg 
gtccagggtc 
gtctttggag 
ttcgcctccc 
cacctcgtct 
ggcagtgacg 

c 99 c 9 ac 9 a 9 
cacgagctgg 
aagttgcgcc 
tttgctgatt 
gaagaaacag 
ctgggtcatc 
gcgtcgcttc 
gactgcatag 
tcggcggtgc 
tgtaaagcag 
tggctaattg 
aacagaatcc 
agtgacgatg 
aaacagaagg 
ccggggaggc 
cctattgagg 
acggtgtccg 
ttcagtgtct 
atggcttttg 
gtgtgcctgg 
cttccatctc 
tgttcaaaga 



tcgttggcgc 
gttcggttgt 
ccaatatcga 
gaagggaagt 
acgagtttgt 
gcatgcaatg 
cggatgatgt 
aagaactcga 
atcaccagaa 
atgagcccct 
atgccgacat 
gcgtcgaagt 
agtctgcggt 
gtgttcagaa 
gtgcggcgtc 
ccaacggaag 
tcgatgttgt 
gcttccgaat 
agaacaaacg 
tccttgagcg 
ctcagccggt 
ggtatggcac 
actcagctgg 
cagccagaaa 
ggctgcagtt 
agccggctaa 
gttacttttc 
aggccagcta 
tcggaagctg 
gcaatctcac 
tgtatctctg 
ctacttgttg 
ctcttgtgtt 



cgagggagct 
cgttgcagct 
cccagatgtc 
ggcggagacg 
agccttcttt 
aaagtttctt 
cttcctcgga 
gtccgccaag 
cgttctggcg 
tgagtcgggc 
agaggtcttt 
tggagccgtc 
gtacggatgt 
tgagccgaat 
cgtcctcgat 
gcaccaattg 
cgttgatcag 
tcaccatgag 
gggtggccag 
cgtactcgac 
gactgtgcaa 
aaacggcaca 
cactaactgg 
aagagccggt 
ttgctgccgc 
agaaaatagc 
gctccctctt 
ttcggcaaac 
cacagcccga 
atcgcccctc 
gcccccgcgg 
caccattgcc 
ctgttacgac 



gaactggtcg 
gaggaactct 
cgagcctccg 
tttgtctacg 
ggggaccagt 
gatcaggtcc 
ccagtcgagc 
ctcgttggag 
ctgtctgtcg 
gtagtacttg 
ggcgagcagc 
atcgtagaga 
tttgttgtgg 
gtcgttgaac 
gtagaacgcc 
gaccaatctg 
cagctcgagg 
gtcgtccact 
caagcccttg 
tctgtatctg 
ggacgaacgg 
aacggcagat 
gtgtagacta 
acgcaatcag 
ccgccgcgcg 
ctatttcgat 
ttgggccaac 
aatagacaga 
atgtcggagc 
cctgcgtctc 
ggcagacagc 
atagtaagaa 
ttacgactta 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2348 



<210> 80 

<211> 802 

<212> DNA 

<213> Ogataea minuta 



<400> 80 

gagacggtgc 

ctacttgatg 

aatgaaaata 

tgatgtatca 

ctaaatgaaa 

tccgagcaac 

atcgacaatt 

gatgtcgagg 

gacattatag 

aagactattg 

tgcatttacg 

acctcttctg 

gtgacgagat 

actgcccgta 



ccgactcttg 
atgtatgacg 
gtctacatga 
cgtgatcttt 
aaaaatcgat 
gcataatgcc 
acgactcgtt 
tctatcgtaa 
tgatttcgcc 
agattttcaa 
aggttttcgg 
tgacccacga 
accattcgtt 
ccactaacgg 



ttcaattctt 
agtgaagatt 
atgaaagaga 
aagttttcga 
ctctgccaaa 
ctcaaccaca 
cacatggaac 
cgacaagatc 
aggccccgga 
gggccggatt 
gggagacgtt 
caatcgtgga 
ggctggaacg 
ta 



ttggttcttt 
gtgttttttt 
tagctgacca 
aatgactaaa 
gactcatcga 
gtctcagaga 
ttgtacgagt 
acaattgaag 
catccgagat 
cctgtttttg 
gagtacgctg 
gtcttcaaga 
ctaaaaactt 



ttctgttttt 
ctctctatag 
atacggggcg 
tttataacga 
taggctaact 
tgcgcaaaaa 
atctttgtca 
aaatcgagga 
cggactctgg 
gagtgtgcat 
gtgaaattgt 
acgttccgca 
tgcccagcga 



ctctacgatt 60 
ttttgactgt 120 
tctggtcacg 180 
aaaaaagagt 24 0 
caggaagcat 3 00 
ggtgctgatg 360 
agagggagcc 42 0 
aatgaagcct 480 
tatctctcga 540 
gggccaacag 600 
tcacggaaaa 660 
gggagttgct 72 0 
gttggaggtg 78 0 
802 
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<210> 81 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 0AP5 for production of an expression 
cassette with A0X1 gene promoter and terminator 

<400> 81 

ctgcagcccc ttctgttttt cttttgacgg 30 



<210> 82 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 0AP3 for production of an expression 
cassette with AOX1 gene promoter and terminator 

<400> 82 

cccccggatc caggaacccg ggaacagaat ctagattttt tcgtaagtcg taagtcgtaa 60 
cagaacacaa gagtctttga acaagttgag 90 



<210> 83 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 0AT5 for production of an expression 
cassette with AOX1 gene promoter and terminator 

<400> 83 

cccccccgga tccgagacgg tgcccgactc ttgttcaatt cttttgg 47 



<210> 84 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer OAT3 for production of an expression 
cassette with AOX1 gene promoter and terminator 



<400> 84 

cccataatgg taccgttagt ggtacgggca gtc 



33 



47 



<210> 85 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer HGP5 for amplification of a gene conferring 
resistance against hygromycin B 

<400> 85 

gtcgacatga aaaagcctga actcaccgc 29 



<210> 86 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer HGP3 for amplification of a gene conferring 
resistance against hygromycin B 

<400> 86 

actagtctat tcctttgccc tcggacg 27 



<210> 87 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer for amplification of 5 ' -region of 
f i-mannosidase gene 

<400> 87 

999999 tc 9 a catggtggtc ttcagcaaaa ccgctgccc 3 9 



<210> 88 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer for amplification of 5' -region of 
f i-mannosidase gene 



<400> 88 

ggggggcggc cgcgtgatgt tgaggttgtt gtacggaacc ccc 



43 
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<210> 89 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer for amplification of Saccharomyces 
cerevisiae SUC2 gene 

<400> 89 

ggggactagt atgcttttgc aagctttcct tttccttttg 40 



<210> 90 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer for amplification of Saccharomyces 
cerevisiae SUC2 gene 

<400> 90 

ccccagatct tattttactt cccttacttg gaacttgtc 39 



<210> 91 

<211> 711 

<212> DNA 

<213> Homo sapiens 



<400> 91 

ctcaccatga 

cgatgtgaca 

gtcaccatca 

aaaccaggga 

ccatcaaggt 

cagcctgaag 

ggccaaggga 

ccgccatctg 

ttctatccca 

tcccaggaga 

ctgacgctga 

cagggcctga 



gggtccccgc 
tccagatgac 
cttgtcgggc 
aagcccctaa 
tcagcggcag 
attttgcaac 
ccaaggtgga 
atgagcagtt 
gagaggccaa 
gtgtcacaga 
gcaaagcaga 
gctcgcccgt 



tcagctcctg 
ccagtctcca 
gagtcaggtt 
gctcctgatc 
tggatctggg 
ttactattgt 
aatcaaacgt 
gaaatctgga 
agtacagtgg 
gcaggacagc 
ctacgagaaa 
cacaaagagc 



gggctcctgc 
tcttccgtgt 
attagcagct 
tatgctgcat 
acagatttca 
caacaggcta 
acggtggctg 
actgcctctg 
aaggtggata 
aaggacagca 
cacaaagtct 
ttcaacaggg 



tgctctggct 
ctgcatctgt 
ggttagcctg 
ccagtttgca 
ctctcaccat 
acagtttccc 
caccatctgt 
ttgtgtgcct 
acgccctcca 
cctacagcct 
acgcctgcga 
gagagtgttg 



cccaggtgca 
aggagacaga 
gtatcagcag 
aagtggggtc 
cagcagcctg 
tccgacgttc 
cttcatcttc 
gctgaataac 
atcgggtaac 
cagcagcacc 
agtcacccat 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

711 



<210> 92 

<211> 234 

<212> PRT 

<213> Homo sapiens 



<400> 92 

Met Arg Val Pro Ala Gin Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro 
15 10 15 
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Gly Ala Arg Cys Asp lie Gin Met Thr Gin Ser Pro Ser Ser Val Ser 
20 25 30 

Ala Ser Val Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser Gin Val 
35 40 45 

lie Ser Ser Trp Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro 
50 55 60 

Lys Leu Leu lie Tyr Ala Ala Ser Ser Leu Gin Ser Gly Val Pro Ser 
65 70 75 80 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser 
85 90 95 

Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin Ala Asn 
100 105 110 

Ser Phe Pro Pro Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys Arg 
115 120 125 

Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp Glu Gin 
130 135 140 

Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr 
145 150 155 160 

Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser 
165 170 175 

Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
180 185 190 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys 
195 200 205 

His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro 
210 215 220 

Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 



<210> 93 
<211> 1428 
<212> DNA 

<213> Homo sapiens 
<400> 93 

atgggttgga gcctcatctt 
gtgcagctgg tggagtctgg 
tgtgcagcct ctggattcac 
gggaaggggc tggagtgggt 
gactcagtga agggccgatt 
caaatgaaca gcctgagagc 
attatggttc ggggagtcta 
gtcaccgtct cctcagctag 
aagagcacct ctgggggcac 



gctcttcctt gtcgctgttg 
gggaggcctg gtcaagcctg 
cttcagtagc tatagcatga 
ctcatccatt agtagtagta 
caccatctcc agagacaacg 
cgaggacacg gctgtgtatt 
ctactactac ggtatggacg 
caccaagggc ccatcggtct 
agcggccctg ggctgcctgg 



ctacgcgtgt ccagtctgag 60 
gggggtccct gagactctcc 120 
actgggtccg ccaggctcca 180 
gtagttacat atactacgca 240 
ccaagaactc actgtatctg 300 
actgtgcgag agatcggatt 360 
tctggggcca agggaccacg 42 0 
tccccctggc accctcctcc 480 
tcaaggacta cttccccgaa 540 
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ccggtgacgg 
gtcctacagt 
ttgggcaccc 
aagaaagttg 
gaactcctgg 
atctcccgga 
gtcaagttca 
gaggagcagt 
tggctgaatg 
gagaaaacca 
ccatcccggg 
tatcccagcg 
accacgcctc 
gacaagagca 
cacaaccact 



tgtcgtggaa 
cctcaggact 
agacctacat 
agcccaaatc 
ggggaccgtc 
cccctgaggt 
actggtacgt 
acaacagcac 
gcaaggagta 
tctccaaagc 
atgagctgac 
acatcgccgt 
ccgtgctgga 
ggtggcagca 
acacgcagaa 



ctcaggcgcc 
ctactccctc 
ctgcaacgtg 
ttgtgacaaa 
agtcttcctc 
cacatgcgtg 
ggacggcgtg 
gtaccgtgtg 
caagtgcaag 
caaagggcag 
caagaaccag 
ggagtgggag 
ctccgacggc 
9999aacgtc 
gagcctctcc 



ctgaccagcg 
agcagcgtgg 
aatcacaagc 
actcacacat 
ttccccccaa 
gtggtggacg 
gaggtgcata 
gtcagcgtcc 
gtctccaaca 
ccccgagaac 
gtcagcctga 
agcaatgggc 
tccttcttcc 
ttctcatgct 
ctgtctccgg 



gcgtgcacac 
tgaccgtgcc 
ccagcaacac 
gcccaccgtg 
aacccaagga 
tgagccacga 
atgccaagac 
tcaccgtcct 
aagccctccc 
cacaggtgta 
cctgcctggt 
agccggagaa 
tctacagcaa 
ccgtgatgca 
gtaaatga 



cttcccggct 
ctccagcagc 
caaggtggac 
cccagcacct 
caccctcatg 
agaccctgag 
aaagccgcgg 
gcaccaggac 
agcccccatc 
caccctgccc 
caaaggcttc 
caactacaag 
gctcaccgtg 
tgaggctctg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1428 



<210> 94 
<211> 475 
<212> PRT 

<213> Homo sapiens 
<400> 94 

Met Gly Trp Ser Leu lie Leu Leu Phe Leu Val Ala Val Ala Thr Arg 
15 10 15 

Val Gin Ser Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Lys 
20 25 30 

Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe 
35 40 45 

Ser Ser Tyr Ser Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
50 55 60 

Glu Trp Val Ser Ser lie Ser Ser Ser Ser Ser Tyr lie Tyr Tyr Ala 
65 70 75 80 

Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Asn 
85 90 95 

Ser Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val 
100 105 110 

Tyr Tyr Cys Ala Arg Asp Arg lie lie Met Val Arg Gly Val Tyr Tyr 
115 120 125 

Tyr Tyr Gly Met Asp Val Trp Gly Gin Gly Thr Thr Val Thr Val Ser 
130 135 140 

Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser 
145 150 155 160 

Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp 
165 170 175 



Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr 
180 185 190 
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Ser Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr 
195 200 205 

Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin 
210 215 220 

Thr Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
225 230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro 
245 250 255 

Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro 
260 265 270 

Pro Lys Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr 
275 280 285 

Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn 
290 295 300 

Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg 
305 310 315 320 

Glu Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val 
325 330 335 

Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser 
340 345 350 

Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys 
355 360 365 

Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp 
370 375 380 

Glu Leu Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe 
385 390 395 400 

Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu 
405 410 415 

Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe 
420 425 430 

Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly 
435 440 445 

Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr 
450 455 460 " 



Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
465 470 475 
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<210> 95 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 95 

Val Gly Phe Leu Asp His Met 
1 5 



<210> 96 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 96 

Pro Ser Thr Lys Gly Val Leu 
1 5 



<210> 97 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer PHI5 for amplification of Ogataea minuta 
HIS3 gene 

<220> 

<221> modif ied_base 
<222> (2) 

<223> a, c, gort 
<220> 

<221> modif ied_base 
<222> (5) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (11) 

<223> a, c, g or t 

<400> 97 

tnggnttyyt ngaycayatg 

<210> 98 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer PHI3 for amplification of Ogataea minuta 
HIS3 gene 
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<220> 

<221> modif ied_base 
<222> (3) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (18) 

<223> a, c, gort 

<400> 98 

arnacnccyt tngtnswngg 20 



<210> 99 

<211> 3831 

<212> DNA 

<213> Ogataea minuta 



<400> 99 

ctgcagatgc 

ttttggaaca 

ccagtgaaga 

aggaaccagc 

tagagaccat 

aattgggcga 

acctgcttaa 

aggacgcgtt 

aacagagcgt 

gcggcccgcg 

tgcttgacaa 

atcacattgt 

agcccgaggg 

tcaacctgcc 

tgcagaaggc 

cggtggaaga 

aggcattgct 

ctcttgagca 

aaacagctga 

gcacttaatc 

tcattgcttg 

cgtcgtactt 



gccgttgctg 
gattgacccg 
tatccgggac 
gtccaacaca 
taacagtcca 
gctggaactg 
atcgatcaag 
gatgatgaag 
tcaccaggcg 
cgccagtcca 
aacgacgcct 
ttccagatgc 
ggttgaacag 
ttcccagaaa 
gattgaaacc 
cgccgagggt 
ggagaacgag 
aatcgaagaa 
ccttgttgat 
catatttttg 
cagttttttg 
cttagtggcc 



ctgagcaaag 
aagttggagg 
ggttggaaca 
aaaagcgact 
aaattgtcga 
ttggaagaga 
tttcttgaga 
tgttttgact 
ctgattctgc 
caccagaagg 
gcctcgcggg 
gagattatca 
atacagctga 
accccggagt 
gaaaaattgg 
gttttggaac 
gaggagttca 
gagcgaccac 
tgaagctcaa 
agaggaggca 
agaatctcag 
caatagtaga 



tgaaagagca 
aactgaacaa 
gctcttttat 
ctgcttcgtc 
aagaaccgac 
ccgaacgatt 
gacatctata 
acgagctgga 
aatatttgga 
agcaggctat 
cttttgaggc 
agcaagaaca 
gatccatgga 
acgaggcttt 
acgaaatcaa 
tgtttgaccg 
accagttgaa 
aaaccgcaaa 
attacgctaa 
ttctaagaat 
agttcttcca 
tatcccagtc 



cagagccaaa 
agaaaaagaa 
caacaagaaa 
tgtcaagaag 
cccgtccaaa 
cgcccagatc 
catagtgagc 
cggtgactcc 
tgatctgttc 
tggtctgttc 
cgattggaag 
cgaacaagag 
ccccaactcg 
caagcaacta 
ccgtgttttt 
ctgcggggtg 
acatgagtac 
ggaagagaac 
acgatatata 
ctctctagtt 
cttctcaggc 
cggctcatca 



attgcctctg 
gctcacattt 
tctgagattg 
accaaggcga 
ccgttagacc 
tcgtctcaag 
gagcagcaga 
cagcgtgcca 
aaagccgctg 
at tgggaaac 
aagacttatc 
ggccaagaag 
gagctggtga 
ccggagccaa 
gcctccatgt 
attcagatac 
gagggagaac 
gcttttatac 
catgtcaatt 
ttgttctcgc 
agcggggtga 
agcaatttgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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cgtactcttc 
agagcaagtc 
ttgtttccac 
cctcctcagt 
cgaatcttct 
aactagcctt 
ggaaagcagc 
ttaaaaaaaa 
ggagtcatcc 
gcaagctgga 
tcaccaatga 
aagagtcgat 
cacaggtgat 
ccaaacactc 
atcacactgc 
ctgtcaaagg 
gcgcggttgt 
aaaaaatcgg 
gcgcgcacat 
aaagcgcgtt 
atgatgttcc 
tattcagccc 
ccgtattctc 
agattgggta 
actacagcca 
ccgtttgtct 
tagtctgatc 
atgttgaacg 
gcaggcgctc 
gcccgtctga 
tccaaggtgt 
ccccatttgc 
caatctccca 
tcgtactcgc 
acggacgagg 
agcgtacaca 
ggaaaagata 
gtccgggcca 
gggtccaaag 
tttctaggcc 
cagatgctgc 
gactcagttc 



cagttggtcc 
agattcaaga 
gtcctcattg 
taacatggtg 
tctggtcaca 
gtctagggag 
gacagatcgg 
gagtcaaaaa 
gacggtatcc 
aaacggcaca 
aaccagcatc 
tcttggggcc 
ctccatcaag 
tggttggtca 
ggaagacgtc 
tctcaaacga 
tgatctgtct 
ggaccttagt 
cacgatgcac 
caaggcattg 
aagtacgaag 
tctccctctt 
acgtagccgc 
ttcaaatatc 
caatttgcac 
tgtcctttct 
ccacacgcag 
gtccgtcggt 
tgggagttta 
cctcccctat 
ccgaggtttc 
cctcttctcc 
caactccgtt 
gacagctggg 
aactcatggt 
gagagccttc 
aggtagggaa 
ggaagacgcc 
agaacgttct 
agtttgagtc 
gagccggtac 
agatactcag 



gcagtcatcg 
attccccgct 
tctcttggga 
tgggtgtttg 
aatcctgtca 
tagttaacat 
ccgaccgcga 
aacatcctga 
ctttacagtc 
aatgatgcaa 
cagatcatcc 
acttacgaaa 
acaggactcg 
ttgattgtcg 
ggcattgcac 
ttcgggcacg 
aaccgtccat 
tgcgagatga 
gtcgattgtc 
gccgttgcca 
ggagttttga 
catcgtgtac 
aaccgaaagc 
gattttaggc 
cgccaccgaa 
tgtcttaatg 
ccgtcaactg 
agactcttct 
ttcctactac 
acgatccggt 
cggcccaatc 
gacgcctcgc 
caagacagca 
ctcgccgccc 
gacgagccgc 
gcagctggaa 
gtgtcttcaa 
cttcctgtcc 
gttgcagctg 
caaactcgtc 
tgttccgttc 
cacaaacgat 



aagccaagta 
ttcttgactg 
ttggctcgac 
ctccctggcc 
gcttaagcat 
ttgtaatgga 
ctatcgacgc 
agagaaactt 
aaaaaaacat 
aagctgctcg 
tgaacttgga 
aagagagcca 
gtttccttga 
agtgtattgg 
tgggtgaaac 
catacgctcc 
ttgccgtggt 
ttccccacgt 
ttcgcgggtt 
tcagagatgc 
tgtaactcca 
agtttaacca 
cctggaaagc 
aattgtaccc 
aaatttcctg 
cgtctggttt 
aaccgccaca 
tcttcgaccc 
tacgagaacc 
tctcctcaaa 
aacttaatca 
atgtcccgtt 
gatatgccgt 
aaactcgacc 
gagctcattc 
ctgtggatca 
tacggaatac 
gacttcaagc 
gtgcggaaac 
gggcttacca 
aaggtaatca 
aaggcgtctt 



cttgtccgca 
gtaaaccagt 
cctgatcttc 
aagtctggcc 
tttgtccgtt 
tcttttcgtt 
gacgataggg 
ttcatgggga 
gtctgaagag 
gtttgcagag 
tggtggactc 
cgcggcacaa 
ccacatgctg 
ggacctgcac 
gttcaagcgc 
actggatgag 
ggaacttggc 
tttggagagt 
caatgaccac 
cacgagctac 
cacccaggag 
tactagacta 
agatagtcta 
gtcttgagcc 
cgtcgcgctt 
gcttgttttc 
ggaatcccgc 
cgtctgccct 
agcgtgagtc 
ctaagctcac 
actcaatgtc 
ctaacggcca 
cacgagtcag 
tggtgaaccc 
aggacgcaat 
aagtgatgtc 
ggattctgat 
tcacgggcgt 
cagaactgct 
aaatcctctc 
aactgttcgg 
tgaagctgca 



aatctactca 
cgtcttctct 
agctctccgt 
accgaggttg 
gtgagaccaa 
ttctttgctc 
attttttttt 
accatgtcca 
aacaagaaac 
gtgagaagag 
atcgagtgca 
aatacgagtg 
cacgcccttg 
attgacgacc 
gcgttgggac 
gctctaagtc 
ctgcgcagag 
tttgctacga 
caccgcagtg 
acgggccgtg 
cttctactaa 
accaactccg 
accaatcgag 
gccacccgcc 
tgtttcccct 
ttctttgccg 
ttcgtcgcga 
gtataaagag 
cttcaacaga 
tgccagaccg 
gtctcctata 
cgccaatgga 
cacgccgtcg 
tgccccggtg 
gcgctcaaga 
gttgctcacg 
cgcatactcg 
tgacttcacg 
ggtgattctg 
catctacaga 
cagactgaca 
9 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3831 



<210> 100 
<211> 238 
<212> PRT 

<213> Ogataea minuta 
<400> 100 

Met Ser Glu Glu Asn Lys Lys Arg Lys Leu Glu Asn Gly Thr Asn Asp 
1 5 10 15 

Ala Lys Ala Ala Arg Phe Ala Glu Val Arg Arg Val Thr Asn Glu Thr 
20 25 30 

Ser lie Gin lie lie Leu Asn Leu Asp Gly Gly Leu lie Glu Cys Lys 
35 40 45 
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Glu Ser lie Leu Gly Ala Thr Tyr Glu Lys Glu Ser His Ala Ala Gin 
50 55 60 

Asn Thr Ser Ala Gin Val lie Ser lie Lys Thr Gly Leu Gly Phe Leu 
65 70 75 80 

Asp His Met Leu His Ala Leu Ala Lys His Ser Gly Trp Ser Leu lie 
85 90 95 

Val Glu Cys lie Gly Asp Leu His lie Asp Asp His His Thr Ala Glu 
100 105 110 

Asp Val Gly lie Ala Leu Gly Glu Thr Phe Lys Arg Ala Leu Gly Pro 
115 120 125 

Val Lys Gly Leu Lys Arg Phe Gly His Ala Tyr Ala Pro Leu Asp Glu 
130 135 140 

Ala Leu Ser Arg Ala Val Val Asp Leu Ser Asn Arg Pro Phe Ala Val 
145 150 155 160 

Val Glu Leu Gly Leu Arg Arg Glu Lys lie Gly Asp Leu Ser Cys Glu 
165 170 175 

Met lie Pro His Val Leu Glu Ser Phe Ala Thr Ser Ala His lie Thr 
180 185 190 

Met His Val Asp Cys Leu Arg Gly Phe Asn Asp His His Arg Ser Glu 
195 200 205 

Ser Ala Phe Lys Ala Leu Ala Val Ala lie Arg Asp Ala Thr Ser Tyr 
210 215 220 

Thr Gly Arg Asp Asp Val Pro Ser Thr Lys Gly Val Leu Met 
225 230 235 



<210> 101 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DHI5 

<400> 101 

ggcccaatag tagatatccc 



<210> 102 
<211> 21 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DHI3 

<400> 102 

cacggcccgt gtagctcgtg g 



<210> 103 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 103 

Ala Val Gly Gly Pro Lys Trp Gly 
1 5 



<210> 104 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 104 

Ala Ala Met Met Leu Lys Leu 
1 5 



<210> 105 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer PLE5 for amplification of Ogataea minuta 
LEU2 gene 

<220> 

<221> modif ied_base 

<222> (3) 

<223> a, c # g or t 
<220> 

<221> modif ied_base 

<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (9) 

<223> a, c, g or t 
<220> 

<221> modif ied__base 

<222> (12) 

<223> a, c, g or t 
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<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 

<400> 105 

gcngtnggng gnccnaartg ggg 23 



<210> 106 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer PLE3 for amplification of Ogataea minuta 
LEU2 gene 

<220> 

<221> modif ied_base 
<222> (1) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 
<222> (7) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (16) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (19) 

<223> a, c, g or t 

<400> 106 

naryttnarc atcatngcng c 21 



<210> 107 
<211> 5615 
<212> DNA 

<213> Ogataea minuta 
<400> 107 

ggatccttcc tcgagatccg ggctgttttt 
cgacttctcc tacgacagac tgtgggagtg 
cgagaaagag ctgctcaggt cgaacgccaa 
ggttttcggc gaacaatgga ccaagctgct 
aagacgccag cgggggaagg tggaggagct 
tggaaagagc cagtttatac atccgtacac 
cgacttttca attttttttt ttacttctaa 
aagggtatgt gtgttgtatt tgcagcagtc 
ctgaagttgg ccagaaactg aacagcaacc 



caccgaccca tccgatcctg acttcaaccc 60 
tctcgtcagg gcagaacaga tgaccggtga 12 0 
ttacgttgcg aaaaccaagt ttggaaagaa 18 0 
cgcctttgtt gtcagcttgg agctgtacaa 240 
gtattgaata aggaatgagg agaatggttt 3 00 
cggatctaac aactgttttc acgaaatgca 360 
aattttttat ctctaaaaag ctgtagatct 420 
cacttagcaa gaacacacac acacgaatga 480 
ctgaggttct tctcaagaag agaaagcaag 54 0 



58 



ccgacagact ccgtctggag aaacaggaac 
gaaagaagca ggaaagaaga aaggccaagt 
acagaaccac tgaaagagag cagtacagag 
agaccgaaac cgagtccgcc aagcaagaag 
tcgttgtgag agtgccaggc ccgcacggtg 
tgtcgttgct gcgtcttgaa cacacctaca 
taagacctct tttgagactg atcaacccgt 
ctgtgagaaa cctgatacaa aagagagcta 
ctagagaggt taacttggat gacaacaatc 
tcatttgcgc agaagatctt attcacgaga 
ccgtcaagtt cctgaatcct ttccaactga 
gtaagttaaa gagactcgag ctgagagaag 
gaaacgctcc tttgaacgag gttgacattg 
atttaagtaa gcatgttcga gtaatgacaa 
gtctatcaat ctcgcgtcga tcgacgcgcg 
gccgccagct cctgcgaccg gagaatattg 
gcattaattt ttcacccaca aaatagctag 
gtgaacagaa ttgtgacctt ttatcgcagc 
atagttttgc tgcctggtga ccacgttggc 
ctcaacgcca tttcggccgc caagccggaa 
Sggggtgctg ctatcgacgc cactggccag 
aagaaagcag atgctgtcct gctaggatct 
gttcgtcctg agcaagggtt gctgaagatc 
agaccgtgca gctttgcatc ggacgccttg 
gtcagaggta ccgacttcgt tgttgtcaga 
agaaaggagg acgacggatc aggattcgct 
gttcaaagaa tcaccagaat ggctgctttc 
gtgtattcgc tggacaaggc caacgttctg 
gaagagacta tcaagaacga gtttcctcag 
gccgctatga ttttggtgaa gtccccaacc 
atgtttggag acatcatctc tgacgaagct 
ccgtccgcat ctctggcttc tcttccagac 
tgccacggtt ccgctcccga tctcgccaaa 
tcggccgcca tgatgctcaa gttgtcgctc 
aaggctgtca gagccgttct ggaccaaggc 
tcgaccactg aggttggaga cgctgttgcc 
ggtcaatttt gtcctgatcc ggcagagatt 
cgagcaacct gtccctcgtg tagcagctgt 
agtgagccag ctaccttctt ctataaatag 
cgagaacgta gcccgacgcg cgcctgaaca 
aaaaatctat tgtcggagcg ttttttcaca 
tgtttttgtt ttgttgttat atgtcaccaa 
ttgtcaggaa ctcaatcact ggttcaagtc 
cgtcctgcat ttcatcctct tctgaacggt 
tgtacgacaa gcaacacaag gcaacaccag 
ggcaccgtca attggaacaa cctgttgaac 
gaaacagcat cctcattcaa cacccctgac 
gatgacagcc tcctgcggtc cgttcaagct 
gacgctcgca tccttctcgt acttgggatg 
gctgtcggac tcatacatct accagtcctc 
cgcggtggcc gctctgaccg gcggaaataa 
cgacgccagc agcgactcca aatgctcgac 
cggtggcagc gggtactatt cggtgtatgt 
gggatcgtcc agttcggaaa ccaccagctc 
ctccacctcg agttctacct cgacaagttc 
tttgacgagc tcctcctctt cgtcatcatc 
ggtctctgaa ataatcacca gttcggcttc 
atcaacctca tcatcgtcct cgtcgtcatc 



aggctagaaa gagaatcgaa gacaaaaaga 6 00 
tcatcagagc agagacacta gttgccaagc 660 
tgaaaagagt gacgcagaac gagaaaatca 72 0 
cagctgggga ggatgaatcc aagcttcttt 780 
ccaaggttcc aggaaaagcc agaaaggttc 84 0 
cgggaacttt cataaagtcc aatgccacca 900 
atgtggtgat tggaactcct tcgctggcca 960 
cagtgaccgt gacaagcgaa gacggttctg 1020 
tcatcgaaga gaaattggga gagtgtggta 1080 
tcgtctcttt gggagactac ttcaagccct 1140 
acgctcctgt ccacggctgg ggtccgctca 1200 
agagcaagaa ccacaaggtg aacaacgctg 1260 
accagttcat cgctgagcag atctgagggt 132 0 
gatctgtcca cagtaagatt tgaaataatg 1380 
acgcctggcc gcttccctcc ggtttccggc 1440 
ttttttatct tgatttttcg aggggattga 1500 
atttcggttt tcaggagcta cagagtcatc 156 0 
ttttttacat tcagaatgac cacaaagaac 1620 
ccggaggttg ttgacgaggc cgtcaaagtt 1680 
atcaagttca acttcgaaca ccacttgatc 1740 
ccaatcacag acgcggctct cgaggcttcc 1800 
gtcggaggtc ctaaatgggg tactggtcaa 1860 
agaaaagagc tcaacttgta cgccaacctg 192 0 
ttggacctgt cgcctctgaa gccggaaatt 1980 
gagcttgttg gaggaatcta cttcggtgag 2040 
tccgacactg aggcctactc cgtgcccgaa 2100 
atggccctgc aaagtgaccc ccctctccca 2160 
gcttcgtccc gtctgtggag aaagaccgtt 2220 
ctgaagctgc aacaccatct gatcgactca 22 8.0 
aaactgaacg gtgttgttct cacatccaac 2340 
tcggtgattc ccggttcgct gggcctgctg 24 00 
tccaacgagg cgttcggtct gtacgagcct 2460 
ggactggtga acccgctggc taccattctc 252 0 
aaccttgttg aggagggccg tgccgtcgaa 2580 
atcatgactg cagacttggg cggatcgtcg 264 0 
aaggaagtga ccaaattgct gggctaaagg 2700 
gttccatgca ctcgtcgagc ttccatgcag 2760 
tcccctcaat atactggtgg ctcgccctat 2820 
cctaggcata cccgaatttc ttttgctccc 2880 
atagaaaaaa ttacaaacaa tagcggctcc 294 0 
gacttctatt cgaggtttgg tgatcctgtt 3000 
tcgtgaaatt ttcagacccg tagttcaacc 3060 
tctgccaatc gcctcggcac aaactactcc 3120 
tgcaagcact ctaggtgttt cagaattggc 3180 
gcagcaaaac aagcaaaaca agcaaaaaaa 324 0 
ggtgatagca tcacgtgtct ttgaaataac 3300 
ctgtatttct ctggctccat cctgttccac 3360 
cgcactcagt ctgctgttga tctcgccatc 3420 
ctactcctcg tcttccgtcg actccctttc 3480 
gttgcattgc aagtcgcaat gctccggtag 3540 
gtgctattgc ggcgactcag taccttctgg 3600 
cgcctgtgac gggtacggct cggaaaactg 366 0 
tgactccgac caagaaaacg actcgagctc 372 0 
cacaagctcc acaagctcca ccagctccag 3780 
aagctcaaca agctcaacga gctcgtccac 384 0 
aacttcgacg tcgagctcct ccgcttcgtc 3900 
ggaaacctcc tccgagacaa ccagcacccc 3960 
gtcatcgtca tccgcatcgt cctcttcgtc 4020 
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gtcctctacg 
aacctcagcc 
ctctgtttcc 
ttccagcggg 
ctcaaaaaaa 
cggagtcat t 
gaaaagatcc 
ttcgccgaga 
cttcctcagc 
ttcgctgcca 
aggtcttagg 
tttacggtcg 
acatccttcc 
cttggtggcg 
caagcctcga 
cgctcaaaaa 
cactcgatcg 
ctcaaaaact 
gtaaatcttc 
gcgtgcaaaa 
gcaatgtcct 
acacagccgg 
agattgttca 
gattgcatca 
gacctcagtc 
cttcttctca 
gacatcaggt 



acgtcgacca 
acaacatcag 
cccggagcaa 
ctgccatcgc 
tcctcctcgg 
gtgggggtgg 
gacgaagacg 
aacaccgttt 
gaagacgacc 
aacgcagcgg 
gttatcaatc 
agttcgtccc 
gaacactgtg 
ttctggagtg 
atagtataat 
caggtccaat 
agttcaaatc 
tgacccgctc 
caacaccgaa 
catatgatgg 
tcaacgccat 
ggtaccggcg 
tcaatttgtt 
atctggcagc 
ctctttcctg 
cgtaacaagc 
caatgttgca 



cgtcttcatc 
ccacaccaac 
ccatgacgtc 
cctcttcatc 
gctccaagct 
cagcgctcat 
aagagtccat 
ctggcctcac 
gactggacca 
ccggagcccc 
cagacctcag 
gttaattagc 
tattttaaac 
cttcttctta 
ttggggagat 
ctccgtactt 
aataccctcg 
ggtcagcgct 
tctgatcgaa 
ctccaacgag 
caacagcgac 
tgttttggaa 
cgacaaccgg 
ctctccaaat 
tcctcctccg 
acccactccc 
cttgttcaca 



ctcgtcctcg 
aacaacctct 
tctgatctac 
ggcaagcacg 
tagcggagga 
cgcagccgtc 
caacgaaaaa 
ggccgccgca 
cccgggagca 
cggtgaccaa 
tgacgaagaa 
atatccctcc 
cggcctgtct 
cgtctttcaa 
ccagtcacaa 
tgggggtgtc 
ttcaccatct 
ttcacaacgt 
gagtgggcca 
gcggaagtac 
tcaccctcca 
ccgttcagtt 
gtgatatgct 
ccacaaacaa 
ttgatcaatg 
atcggcccgt 
tcgat 



atctcctcta 
ccggctgtgc 
atcacccagt 
ggcaacagct 
gctatagccg 
ctcttttttc 
gacgttccgg 
atcggagtca 
aaccgaagat 
tcgcgcaagt 
tgaaaaattg 
ttttgtatgg 
gcttgttagc 
tataatcttg 
gagtacaata 
cttaacctta 
cccacagcgg 
aatcaacctc 
aagcgtcgtc 
atgccgaccc 
cgtacgcgaa 
gcgtgtgttc 
cgtggtcggc 
gcggaggagc 
ggtccagccg 
aaatcttgtg 



catcagccac 
tcgtttccac 
cgatctccgc 
cggcccacga 
gaatagtcat 
tgtggtacaa 
agatgctcaa 
ataagttcgg 
tcagcgacgg 
ccaaagccgg 
tttcttgtct 
ttattcccca 
gtctaggcta 
actacttatt 
tacacaaaaa 
atgaccagcc 
actcaattct 
ctcctcggtg 
cgcacccaac 
cgaggaaaga 
cgaaacgttc 
catggccaga 
gtcgtactcg 
cagagttccc 
gacacggggt 
cgacgaaatg 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5615 



<210> 108 
<211> 363 
<212> PRT 

<213> Ogataea minuta 



<400> 108 
Met Thr Thr Lys 
1 

Glu Val Val Asp 
20 

Lys Pro Glu lie 
35 

Ala lie Asp Ala 
50 

Ser Lys Lys Ala 
65 

Trp Gly Thr Gly 



Lys Glu Leu Asn 
100 

Asp Ala Leu Leu 
115 



Asn lie Val Leu 
5 

Glu Ala Val Lys 



Lys Phe Asn Phe 
40 

Thr Gly Gin Pro 
55 

Asp Ala Val Leu 
70 

Gin Val Arg Pro 
85 

Leu Tyr Ala Asn 



Asp Leu Ser Pro 
120 



Leu Pro Gly Asp 
10 

Val Leu Asn Ala 
25 

Glu His His Leu 



lie Thr Asp Ala 
60 

Leu Gly Ser Val 
75 

Glu Gin Gly Leu 
90 

Leu Arg Pro Cys 
105 

Leu Lys Pro Glu 



His Val Gly Pro 
15 

lie Ser Ala Ala 
30 

He Gly Gly Ala 
45 

Ala Leu Glu Ala 



Gly Gly Pro Lys 
80 

Leu Lys He Arg 
95 

Ser Phe Ala Ser 
110 

He Val Arg Gly 
125 
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Thr Asp Phe Val Val Val Arg Glu Leu Val Gly Gly lie Tyr Phe Gly 
130 135 140 

Glu Arg Lys Glu Asp Asp Gly Ser Gly Phe Ala Ser Asp Thr Glu Ala 
145 150 155 160 

Tyr Ser Val Pro Glu Val Gin Arg lie Thr Arg Met Ala Ala Phe Met 
165 170 175 

Ala Leu Gin Ser Asp Pro Pro Leu Pro Val Tyr Ser Leu Asp Lys Ala 
180 185 190 

Asn Val Leu Ala Ser Ser Arg Leu Trp Arg Lys Thr Val Glu Glu Thr 
195 200 205 

lie Lys Asn Glu Phe Pro Gin Leu Lys Leu Gin His His Leu lie Asp 
210 215 220 

Ser Ala Ala Met He Leu Val Lys Ser Pro Thr Lys Leu Asn Gly Val 
225 230 235 240 

Val Leu Thr Ser Asn Met Phe Gly Asp He He Ser Asp Glu Ala Ser 
245 250 255 

Val He Pro Gly Ser Leu Gly Leu Leu Pro Ser Ala Ser Leu Ala Ser 
260 265 270 

Leu Pro Asp Ser Asn Glu Ala Phe Gly Leu Tyr Glu Pro Cys His Gly 
275 280 285 

Ser Ala Pro Asp Leu Ala Lys Gly Leu Val Asn Pro Leu Ala Thr He 
290 295 300 

Leu Ser Ala Ala Met Met Leu Lys Leu Ser Leu Asn Leu Val Glu Glu 
305 310 315 320 

Gly Arg Ala Val Glu Lys Ala Val Arg Ala Val Leu Asp Gin Gly He 
325 330 335 

Met Thr Ala Asp Leu Gly Gly Ser Ser Ser Thr Thr Glu Val Gly Asp 
340 345 350 

Ala Val Ala Lys Glu Val Thr Lys Leu Leu Gly 
355 360 



<210> 109 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DL5 



<400> 109 

caggagctac agagtcatcg 



20 
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<210> 110 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DL3 

<400> 110 

acgagggaca ggttgctcgc 2 0 



<210> 111 

<211> 8 

<212> PRT 

<213> Saccharomyces cerevisiae 

<400> 111 

Asp Thr Gly Ser Ser Asp Leu Trp 
1 5 



<210> 112 

<211> 8 

<212> PRT 

<213> Saccharomyces cerevisiae 

<400> 112 

Phe Gly Ala lie Asp His Ala Lys 
1 5 



<210> 113 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer PLE5 for amplification of Ogataea minuta YPS1 gene 

<220> 

<221> modi f ied_base 

<222> (6) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (12) 

<223> a, c, g or t 

<220> 

<221> modif ied_base 

<222> (15) 

<223> a, c, g or t 
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<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, gort 

<400> 113 

gayacngght cntcngayyt ntgg 24 



<210> 114 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer PLE3 for amplification of Ogataea minuta YPS1 gene 

<220> 

<221> modif ied_base 
<222> (9) 

<223> a, c, g or t 
<220> 

<221> modif ied_base 

<222> (21) 

<223> a, c, g or t 

<400> 114 

ttygghgcna tygaycaygc naa 2 3 



<210> 115 
<211> 3661 
<212> DNA 

<213> Ogataea minuta 



<400> 115 

gaattcacca 

tgccaggatt 

ctctctagtt 

ggtaagctca 

ctgcaatttc 

ccgcctatag 

aaacagattt 

aaagaggctc 

ttgatacaag 

tggatctcaa 

tcaataaacc 

aggcgaaccg 

gctcagattc 

aaacattaga 

acgcggcaat 

cgttggaggt 

ggaacgaact 

cttacttggg 

gcgatttcac 



gttatctgga 
attgctcatt 
tcattgcact 
actggacaac 
attctgtgtt 
ttctgggttg 
tccgggtgaa 
gctttgtgga 
ttggttgatt 
attaagttat 
atcagtgatg 
gaggtcgagg 
agacagagag 
gccagaggcc 
gacgcaggat 
gacgaggctt 
gcagggcgcg 
caagatgttg 
cgagaactcg 



cgaggcttgt 
gattgggctt 
gcccagtttt 
ggaaaacgag 
tatctgcccc 
ccatcacatc 
gtgtccatac 
tatatgattt 
tttcaaggct 
caaaagctac 
agcgacgcac 
aagaagcgga 
gaggaagtga 
atggatctgg 
ttggacaaga 
gggcagaccg 
cagtccaagg 
gggctctaca 
ctgtccatgt 



gtttcagacg 
tcttccaact 
atcaaatacc 
ctgccgtttg 
atctccaaag 
atatcgaagg 
tgtccaatga 
gaaagattac 
gtgtaggaaa 
gtaggggctg 
agataagaaa 
gaaccgagga 
aggagagcgt 
ctatcgatca 
ccagactgaa 
ttgactccgg 
ttgagggggc 
gtgacgactt 
tggcgcagct 



agttgctgta 
ctcccctgta 
acaggatatc 
aaatcaatct 
aggagactac 
agagcgctga 
cttggtatga 
agcgattgta 
atttggagta 
gccgagacga 
cggagcgcag 
tttctcgtcg 
tgaagctacc 
gctgaatgtt 
cttacgccgt 
acgcgctgca 
ccgtaatgag 
ggatgctctc 
actgaaaaac 



cagtcaaatt 
caacaccctt 
caagctttct 
gccactgttt 
tcctacgaat 
caagctattg 
agatcgtctc 
agacggttat 
aaaaaaattt 
cagcactgaa 
aagagcaaga 
tcttctgaga 
gaggaggttg 
acaggtgccg 
cttgatgccc 
aagttgggcg 
ctgagaaatg 
aggcagcaga 
agcggtaatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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tgtttgatga cgaggcgctg aagtcattag ttgaatagaa aacaggcaaa taattttggc 1200 
agggccgttt tgccgatgcg atataggctc tgttgccgat acgttcccgg ggagcttccc 1260 
tacggttgct gttctgtcgg tcttggcgag ttttccactt ttgcggccgc acgaagccca 1320 
gactagccag tcataccagc cgtggactcc gcctacttga cggggaaatt tttcccgtgc 1380 
cacttttccc ggggcaaaat aagtggctaa gcagcagaca agaaaaaaag gctcgaaaaa 144 0 
gttaaaagaa gtaacagcag aatatatata gccaagtgtg gtttgtcaga agcaaagcac 1500 
gctaatttga agcattttca cgggtgaaca gcacacaaag atctccaggg gggcgttctg 1560 
gttgtgaatt ttatatagag agcaaaaagg atttagaaat cgccgaaatt tgtttggttt 1620 
agaagtgctt ttattgtgag acgttttcgt gtatcagaag ggcatcttga cactcggtta 1680 
gaatatgagg tgcaaaaaca ttttggaagc aatgttgttg gtggccgggg gcacagtccc 174 0 
cgtggcgggg ttgcctgctg gcgagtcgaa ggcaaactcg agtccggggt atctgcgaat 1800 
ggaggccgag atctacagag ggcattcgtt tgagacgtcc caacgcggag gacggccgta 1860 
tatgctggag aagcgagccg aggacggatc ggtgctaatg gagctgcaga acaaccaatc 1920 
attttacaaa gtggagcttg aagtgggttc agacaagcaa aagattggtg tcttagtgga 1980 
tacgggttcg tcagatctgt ggatcatgaa ccaaaacaac tcgtactgtg agtcctcgtc 2040 
ctcgtcctcc aagatgcggg aacgcaaggg cagaaagctg agtgatctca gaaacctgaa 2100 
cttagacgtg agcgaaaaaa acgtgaaggc tgtcggggct gcagagactg aaacgatgac 216 0 
cttatcggtg ggagaaggtc tgttttcctg gttcgaaact cagacggacg gcagcggggg 222 0 
agaaacagaa acggcttccg gagacagctc cgaggccacc attgattgct ctgtttacgg 228 0 
gacattcgac ccgtcctcct ccgatacgtt caaatcgaac ggaacggagt tttcgatttc 234 0 
atacgccgat gacagtttcg ccaagggaac atggggcacc gacgatgtga ccttcaacgg 24 00 
tgtcacggtg gatcaattgt ctatggcaat tgctgatgag accaactcgt cgatgggagt 2460 
tcttggaatt ggactcaagg gcctggagac tacgtactcc ggagacgtga cgaatgcgta 2520 
cacgtacgaa aacttgccgt acaagatgca gtcccaggga ctgatcagca agccggtcta 2580 
ctcggtttat ttgaacgaca gcgagtccag cgctgcgtcg attttgtttg gagccgttga 2640 
ccacgacaag tacactggaa cgttgacgtt gctcccgatc atcaacacgg ccgaaagcct 2700 
gggctactcg acccccgtca gactcgaggt gacactgtca aagctttaca cgggctcgtc 2760 
ctcgaataaa acggccgtga gcatcgcgtc tggggctgcg gcagctctgt tggacacggg 282 0 
aaccacgttg acgtacgttc cttcggacat catctctaca atcgtggacc agtacggctt 2880 
tcaatacagc agttcggttg gaacgtatgt ggccaagtgc gactcgctcg acgatgctga 294 0 
gattgtcttt gacttccagg gaaccaagat atgggttccg ttctcgtcgt ttgcggtctc 3000 
actcaccacc aacggaggct cgcagtcgtc gtactgtgcg cttggcttga tggacagcgg 3 060 
agacgacacc ttcactctgg gagactcgtt cctcaacaac gtctacttcg ttgccgatct 3120 
agagaacctg cagattgcca ttgctccggc taacctggac tccacgtcgg aggacattga 3180 
agtggtgagc gactcgggaa tcccgtctgc aaagtccgct tctgcctact cttccagttg 3240 
gggtgcgtct ggctccgcgg tggcctcgtc gttgtctgtt caaaccggcg cagaaaccgt 33 00 
cacctccacc gatgctggct ccgactccac gggatctgcg tctgggtcgt ccggttcggc 3360 
ctcgtcctcc tcgtccaagt cttctgcgtc ctcctcgtct ggttcgtccg gctcgtcgtc 3420 
caagtcgggc tcgagctcgt ccaagtacgc tgccggaaac gcctggggaa tgagcgtctg 34 8 0 
cagcctggct ttcaccatcg cggtctcggt gttggtgatt ggctaacctg gccgcagccg 354 0 
ctttgcttcc atcctgctga ccccgccggt aactctggtc ggattgtatt acatacatac 3600 
atacctccca cgcgtttgat atcacgatgt gacttatttt tctgtgcaca gcccggaatt 3660 
c 3661 



<210> 116 
<211> 604 
<212> PRT 

<213> Ogataea minuta 
<400> 116 

Met Leu Leu Val Ala Gly Gly Thr Val Pro Val Ala Gly Leu Pro Ala 
15 10 15 



Gly Glu Ser Lys Ala Asn Ser Ser Pro Gly Tyr Leu Arg Met Glu Ala 
20 25 30 
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Glu lie Tyr Arg Gly His Ser Phe Glu Thr Ser Gin Arg Gly Gly Arg 
35 40 45 

Pro Tyr Met Leu Glu Lys Arg Ala Glu Asp Gly Ser Val Leu Met Glu 
50 55 60 

Leu Gin Asn Asn Gin Ser Phe Tyr Lys Val Glu Leu Glu Val Gly Ser 
65 70 75 80 

Asp Lys Gin Lys lie Gly Val Leu Val Asp Thr Gly Ser Ser Asp Leu 
85 90 95 

Trp lie Met Asn Gin Asn Asn Ser Tyr Cys Glu Ser Ser Ser Ser Ser 
100 105 110 

Ser Lys Met Arg Glu Arg Lys Gly Arg Lys Leu Ser Asp Leu Arg Asn 
115 120 125 

Leu Asn Leu Asp Val Ser Glu Lys Asn Val Lys Ala Val Gly Ala Ala 
130 135 140 

Glu Thr Glu Thr Met Thr Leu Ser Val Gly Glu Gly Leu Phe Ser Trp 
145 150 155 160 

Phe Glu Thr Gin Thr Asp Gly Ser Gly Gly Glu Thr Glu Thr Ala Ser 
165 170 175 

Gly Asp Ser Ser Glu Ala Thr lie Asp Cys Ser Val Tyr Gly Thr Phe 
180 185 190 

Asp Pro Ser Ser Ser Asp Thr Phe Lys Ser Asn Gly Thr Glu Phe Ser 
195 200 205 

lie Ser Tyr Ala Asp Asp Ser Phe Ala Lys Gly Thr Trp Gly Thr Asp 
210 215 220 

Asp Val Thr Phe Asn Gly Val Thr Val Asp Gin Leu Ser Met Ala lie 
225 230 235 240 

Ala Asp Glu Thr Asn Ser Ser Met Gly Val Leu Gly lie Gly Leu Lys 
245 250 255 

Gly Leu Glu Thr Thr Tyr Ser Gly Asp Val Thr Asn Ala Tyr Thr Tyr 
260 265 270 

Glu Asn Leu Pro Tyr Lys Met Gin Ser Gin Gly Leu lie Ser Lys Pro 
275 280 285 

Val Tyr Ser Val Tyr Leu Asn Asp Ser Glu Ser Ser Ala Ala Ser lie 
290 295 300 

Leu Phe Gly Ala Val Asp His Asp Lys Tyr Thr Gly Thr Leu Thr Leu 
305 310 315 320 

Leu Pro lie lie Asn Thr Ala Glu Ser Leu Gly Tyr Ser Thr Pro Val 
325 330 335 
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Arg Leu Glu Val Thr Leu Ser Lys Leu Tyr Thr Gly Ser Ser Ser Asn 
340 345 350 

Lys Thr Ala Val Ser lie Ala Ser Gly Ala Ala Ala Ala Leu Leu Asp 
355 360 365 

Thr Gly Thr Thr Leu Thr Tyr Val Pro Ser Asp lie lie Ser Thr lie 
370 375 380 

Val Asp Gin Tyr Gly Phe Gin Tyr Ser Ser Ser Val Gly Thr Tyr Val 
385 390 395 400 

Ala Lys Cys Asp Ser Leu Asp Asp Ala Glu lie Val Phe Asp Phe Gin 
405 410 415 

Gly Thr Lys lie Trp Val Pro Phe Ser Ser Phe Ala Val Ser Leu Thr 
420 425 430 

Thr Asn Gly Gly Ser Gin Ser Ser Tyr Cys Ala Leu Gly Leu Met Asp 
435 440 445 

Ser Gly Asp Asp Thr Phe Thr Leu Gly Asp Ser Phe Leu Asn Asn Val 
450 455 460 

Tyr Phe Val Ala Asp Leu Glu Asn Leu Gin lie Ala lie Ala Pro Ala 
465 470 475 480 

Asn Leu Asp Ser Thr Ser Glu Asp lie Glu Val Val Ser Asp Ser Gly 
485 490 495 

lie Pro Ser Ala Lys Ser Ala Ser Ala Tyr Ser Ser Ser Trp Gly Ala 
500 505 510 

Ser Gly Ser Ala Val Ala Ser Ser Leu Ser Val Gin Thr Gly Ala Glu 
515 520 525 

Thr Val Thr Ser Thr Asp Ala Gly Ser Asp Ser Thr Gly Ser Ala Ser 
530 535 540 

Gly Ser Ser Gly Ser Ala Ser Ser Ser Ser Ser Lys Ser Ser Ala Ser 
545 550 555 560 

Ser Ser Ser Gly Ser Ser Gly Ser Ser Ser Lys Ser Gly Ser Ser Ser 
565 570 575 

Ser Lys Tyr Ala Ala Gly Asn Ala Trp Gly Met Ser Val Cys Ser Leu 
580 585 590 

Ala Phe Thr He Ala Val Ser Val Leu Val He Gly 
595 600 



<210> 117 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DY5 

<400> 117 

ctcaagggcc tggagactac g 



<210> 118 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer DY3 

<400> 118 

cgggattccc gagtcgctca cc 



<210> 119 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PDI5 for amplification of 5* -region of 
Saccharomyces cerevisiae PDI gene 

<400> 119 

tctagaatga agttttctgc tggtgccgtc ctg 

<210> 120 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer PDI3 for amplification of 3 ' -region of 
Saccharomyces cerevisiae PDI gene 

<400> 120 

ggatccttac aattcatcgt gaatggcatc ttc 



<210> 121 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 
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<400> 121 
His Asp Glu Leu 
1 



